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Pigment production by cells having introduced cytochrome P4S0 sequences. 
FIELD OF THE INVENTION 

5 The present invention relates generally to a method for producing a pigment in or by 
biological material including plant, animal and microbial cells. Particularly useful 
biological material contemplated herein includes microbial cells, plant cells such as plant 
tissue and organs including petals, flowers, stems, leaves and seeds, plant fibrous material 
such as cotton, and non-cellular plant material. More particularly, the present invention is 

10 directed to the use of a cytochrome P450 protein or a derivative or homologue thereof 
together with one or more homologous or heterologous associated proteins to facilitate the 
conversion of a substrate, when available to or in a cell, to a coloured product. The 
coloured product makes the cell in which it is produced, as well as surrounding tissue or 
medium to which the product leaks and tissue and plant material connected by a vascular 

15 system to the cell in which the product is formed, change colour. The present invention is 
further directed to microorganisms, transgenic animal cells and transgenic plants, parts of 
plants including cells of plants and progeny of said plants which are capable of expressing 
genetic material encoding a cytochrome P450 protein or a derivative or homologue thereof 
and one or more homologous or heterologous associated proteins wherein microorganisms, 

20 transgenic animal cells and plant cells expressing such genetic material or surrounding 
cells or cells and plant material connected at the vascular level to said first mentioned cells 
undergo a colour change relative to cells or plant tissue which do not express this genetic 
material. The present invention further provides parts of plants including cells, tissues and 
organs such as petals, flowers (e.g. cut or severed flowers), stems, leaves and seeds, 

25 fibrous material such as cotton and non-cellular material which have undergone a colour 
change resulting from a pigment whose production is facilitated by the presence of a 
cytochrome P450 protein or a derivative or homologue thereof and one or more 
homologous or heterologous associated proteins. The method of the present invention is 
predicated in part on the oxidation of indole or related molecules by the cytochrome P450 

30 protein and a homologous or heterologous associated protein to indoxyl and oxidized 
indoxyl or related molecules which alone or in combination with other molecules can form 
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a range of pigments including indigo and indirubin. The production of pigment in 
microbial cells, animal cells or plant cells is useful inter alia in the ornamental flower 
industry, as commercial tags and as sources of pigments for the dye industry. The pigment 
produced is also useful as a marker for transformation of microbial, plant and animal cells. 

5 

BACKGROUND OF THE INVENTION 

Reference to any prior art in this specification is not, and should not be taken as, an 
acknowledgment or any form of suggestion that this prior art forms part of the common 
10 general knowledge in Australia or any other country. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Nucleotide and amino acid sequences are referred to 
by a sequence identifier, i.e. <400>1, <400>2, etc. A sequence listing is provided after the 
15 claims. 

The increasing sophistication of recombinant DNA technology is greatly facilitating 
research and development in the chemical, agricultural and horticultural industries. Of 
particular importance is the use of genetic techniques to introduce useful phenotypes in 
20 plants such as disease resistance, frost resistance and altered flower colour. 

The ability to alter flower colour has been a major focus of the horticultural industry. The 
use of genetic manipulation has the potential to vastly increase the range of flower colour 
which has so far been achieved using conventional breeding practices. One approach has 

25 been to manipulate the flow of metabolites through the flavonoid biosynthetic pathway. 
This pathway results in the production of anthocyanins which are glycosylated derivatives 
of cyanidin, delphinidin, petunidin, peonidin, malvidin and pelargonidin. The flavonoid 
pathway is well established and the pattern of hydroxylation of the B-ring of anthocyanins 
plays a key role in determining petal colour (see International Patent Application No. 

30 PCT/AU94/00265 [WO 94/28140]). 
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In order to fully exploit the potential spectrum of colour change, other approaches of 
manipulating plant colour need to be considered. 

Cytochrome P450 proteins are a superfamily of haemoproteins which catalyze the 
5 oxidation of a diverse range of chemicals in a wide variety of organisms. Cytochrome 
P450 proteins are involved, for example, in clearance of xenobiotic chemicals, 
biosynthesis of hormones and other signalling molecules and participate in homeostatic 
mechanisms. Cytochrome P450-dependent monooxygenases require for activity, 
associated proteins which catalyze the transfer of electrons from, for example, a co- 

10 enzyme to a prosthetic haem group on the cytochrome P450 molecule. Generally, the 
associated protein is a reductase such as a co-enzyme-dependent cytochrome P450 
reductase. An associated protein may also have a role in facilitating interaction between a 
cytochrome P450 and a reductase. An example of such a molecule is cytochrome b 5 (De 
Vetten et al, 1999). This may be used with a cytochrome P450 protein or in addition to 

1 5 another associated protein. 

In work leading up to the present invention, the inventors observed that microorganisms 
engineered to express genetic material encoding a cytochrome P450 protein turned 
themselves and their culture medium blue. In accordance with the present invention, it is 
20 proposed to exploit this phenomenon to generate pigment production in microbial cells, 
animal cells and in plant cells and tissue and other plant material. 
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SUMMARY OF THE INVENTION 

Throughout this specification, unless the context requires otherwise, the word "comprise", 
or variations such as "comprises" or "comprising", will be understood to imply the 
5 inclusion of a stated element or integer or group of elements or integers but not the 
exclusion of any other element or integer or group of elements or integers. 

One aspect of the present invention is directed to the use of genetic material encoding a 
cytochrome P450 protein or a functional derivative or homologue thereof in the generation 
10 of a genetically modified cell, which cell has the capacity to produce a pigment in the 
presence of indole or a precursor, analogue or derivative thereof upon expression of said 
genetic material. 

Another aspect of the present invention provides for a use of genetic material encoding a 
cytochrome P450 protein or a derivative or homologue thereof in the generation of a 
genetically modified cell, which cell has the capacity to produce a pigment in the presence 
of indole or a precursor, analogue or derivative thereof upon expression of said genetic 
material wherein the genetic material, encoding the cytochrome P450 protein or its 
functional derivative or homologue, is co-expressed with genetic material encoding an 
associated protein. 

A further aspect of the present invention is directed to the use of genetic material encoding 
a mammalian or microbial cytochrome P450 monooxygenase or a functional derivative or 
homologue thereof in the generation of a genetically modified cell, which cell has the 
25 capacity to produce a pigment in the presence of indole or a precursor, analogue or 
derivative thereof and upon expression of said genetic material wherein said genetic 
material encoding the cytochrome P450 monooxygenase or its functional derivative or 
homologue is co-expressed with genetic material encoding an associated protein selected 
from a co-enzyme-dependent cytochrome P450 reductase and/or a cytochrome bs or a 
30 functional derivative or homologue of either molecule. 
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Still another aspect of the present invention provides a method of inducing a colour change 
in a plant or part of a plant said method comprising introducing into plant cells a genetic 
construct encoding a cytochrome P450 protein or a functional derivative or homologue 
thereof, regenerating a plant from said plant cells and subjecting said plant to appropriate 
5 conditions sufficient to permit expression of a nucleotide sequence on said genetic 
construct which encodes said cytochrome P450 protein such that in the presence of indole 
or a precursor, analogue or derivative thereof said cytochrome P450 protein facilitates the 
conversion of indole or its precursor, analogue or derivative to an intermediate which is 
capable of oxidation to a pigment. 

10 

Yet another aspect of the present invention provides a genetically modified cell or 
multicellular animal or plant or progeny thereof or parts of said transgenic plant or animal 
wherein said transgenic plant or animal comprises expressible genetic material encoding a 
microbial or mammalian cytochrome P450 protein. 

15 

Even yet another aspect of the present invention provides a genetically modified cell or 
multicellular animal or plant or progeny or parts thereof comprising expressible genetic 
material encoding a microbial or mammalian cytochrome P450 monooxygenase such that 
upon expression of said genetic material and in the presence of an associated protein and 
20 indole or a precursor, analogue or derivative thereof, the indole or its precursor, analogue 
or derivative is converted to an intermediate which is capable of oxidation to a pigment. 

A further aspect of the present invention is directed to a transgenic plant or progeny or 
parts thereof comprising expressible genetic material encoding a microbial or mammalian 
25 cytochrome P450 monooxygenase such that upon expression of said genetic material and 
in the presence of an associated protein and indole or a precursor, analogue or derivative 
thereof, the indole or its precursor,. analogue or derivative is converted to an intermediate 
which is capable of oxidation to a pigment. 

30 Still another aspect of the present invention provides a transfected or transformed cell, 
tissue, organ or non-cellular material which contains or is capable of producing 
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cytochrome P450 protein or a functional derivative or homologue thereof. 

Yet another of the present invention is further directed to flowers (e.g. cut or severed 
flowers), parts of plants, fibrous material from plants (e.g. cotton), non-cellular material 
5 and reproductive portions of plants (e.g. cuttings, seeds, pollen, callus) which are coloured 
or have the capacity to be coloured in the presence of indole or a precursor thereof due to 
the presence of cytochrome P450 protein (e.g. monooxygenase) and an associated protein 
(eg. NADPH-dependent cytochrome P450 reductase or cytochrome b 5 ). 

10 Even yet another aspect of the present invention provides for the use of a genetic sequence 
comprising a nucleotide sequence expressible in plant cells which encodes a cytochrome 
P450 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 

15 precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the cytochrome P450 monooxygenase comprises the amino acid 
sequence (F/PAV/L/Y)(G/S/C/N/H/E)X (G/D/A/E)X(R/H/S/K/T)XCXi(G/A) wherein X is 
any amino acid and Xi is selected from V, M, A, L, I, P, F or T and wherein where 
residues in parentheses represent alternatives at a single position. 

20 

Still a further the present invention provides for the use of a genetic sequence comprising a 
nucleotide sequence expressible in plant cells which encodes a cytochrome P450 
monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 

25 precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the cytochrome P450 monooxygenase comprises the amino acid 
sequence (F/L)(G/S)XGX(R/H)XCXi(G/A) where X is any amino acid and X! is selected 
from V, M, A, L, I, P, F or T and where residues in parentheses represent alternatives at a 

30 single position. 
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Yet another the present invention is directed to the use of a genetic sequence comprising a 
nucleotide sequence expressible in plant cells which encodes a cytochrome P450 
monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
5 precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the nucleotide sequence is selected from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>8, 
10 <400>10or<400>12; 

(ii) a nucleotide sequence defined by <400>7, <400>9 or <400>1 1 ; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
15 or(ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>2, <400>4 or <400>6; 

20 (v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>1, <400>3 or <400>5; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

25 

Even yet another the present invention provides for the use of a genetic sequence 
comprising a nucleotide sequence expressible in plant cells which encodes a cytochrome 
P450 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
30 precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
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pigment wherein the genetic sequence further comprises a nucleotide sequence encoding 
an NPR wherein the nucleotide sequence is selected from:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>14 and 
5 <400>16; 

(ii) a nucleotide sequence defined by <400>13 and <400>15; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
10 or(ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>14 and <400>16; 

15 (v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>13 and <400>15; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

20 

A further aspect of the present invention provides for the use of a genetic sequence 
comprising a nucleotide sequence expressible in plant cells which encodes a cytochrome 
P450 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
25 precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the nucleotide sequence is selected from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>2 1 ; 

30 

(ii) a nucleotide sequence defined by <400>20; 
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(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

5 (iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>2 1 ; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>20; or 

10 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

Still another aspect of another aspect of the present invention provides for the use of a 
15 genetic sequence comprising a nucleotide sequence expressible in plant cells which 
encodes a cytochrome P450 monooxygenase in the generation of a transgenic plant or 
transformed cells of a plant which, upon expression of said nucleotide sequence, and in the 
presence of indole or a precursor, analogue and/or derivative thereof, have the capacity to 
convert indole or a precursor, analogue or derivative thereof to an intermediate which is 
20 oxidizable to a pigment wherein the genetic sequence further comprises a nucleotide 
sequence encoding an bacterial flavodoxin wherein the nucleotide sequence is selected 
from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>23; 

25 

(ii) a nucleotide sequence defined by <400>22; 



30 



(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 
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(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>23; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>22; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

Yet another aspect of the present invention further contemplates the use of plant or plant 
material as a source of pigment for use, for example, in the dye or cosmetics industry or as 
commercial tags. 

Even yet another aspect of the present invention further provides a genetic construct 
comprising a cistron encoding a microbial or mammalian cytochrome P450 protein for use, 
and more particularly, when used, in the generation of a transformed plant cell such that 
upon expression of said cistron, the cytochrome P450 facilitates conversion of indole or a 
precursor thereof to a molecule oxidizable to a pigment. 

In a further aspect of the invention, the cytochrome P450 monooxygenase comprises the 
amino acid sequence (F/PAV/IVY)(G/S/C/N/H/E)X (G/D/A/E)X(R/H/S/K/T)XCX|(G/A) 
wherein X is any amino acid and X] is selected from V, M, A, L, I, P, F or T and wherein 
where residues in parentheses represent alternatives at a single position. 

In still another aspect of the invention, the cytochrome P450 monooxygenase comprises 
the amino acid sequence (F/L)(G/S)XGX(R/H)XCXi(G/A) where X is any amino acid and 
Xi is selected from V, M, A, L, I, P, F or T and where residues in parentheses represent 
alternatives at a single position. 

In yet another aspect of the present invention, the nucleotide sequence encoding the 
cytochrome P450 monooxygenase is selected from:- 
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ft) a nucleotide sequence encoding the amino acid sequence defined in <400>8, 
<400>10, <400>12 or <400>21; 

5 (ii) a nucleotide sequence defined by <400>7, <400>9 or <400>1 1 or <400>20; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

10 (iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>2, <400>4, <400>6 or 
<400>21; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
1 5 defined in <400>1 , <400>3, <400>5 or <400>20; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

20 In even yet another aspect of the present invention, a particularly useful associated protein 
is human NPR (hNPR) encoded by> 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>1 6; 

25 (ii) a nucleotide sequence defined by <400> 1 5 ; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 



30 (iv) 



a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>14; 
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(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>13;or 

5 (vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

In a further aspect of the present invention, another useful associated protein is a bacterial 
flavodoxin encoded by:- 

10 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>23; 

(ii) a nucleotide sequence defined by <400>22; 

15 (iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>23; 

20 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>22; or 



(vi) 

25 



a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagrammatic representation of the strategy used for generation of 
monocistronic and bicistronic constructs for co-expression of cytochrome P450 forms and 
5 hNPR. Solid bars represent P450 coding sequences; shading represents hNPR coding 
sequence; hatched bars represent other inserts, for example, linkers. 

Figure 2 is a photographic representation of blue pigments in P450 2A6 cultures. (A) 
Cultures (2 ml) were prepared with (+) and without (-) isopropyl-j3-D-thiogalactoside 
10 [IPTG] induction in TB media (48 hr at 29°C) with vigorous shaking. (B) Expression was 
induced by IPTG with (+) or without (-) the addition of 1 .0 mM indole to the culture. 

Figure 3 is a photographic representation of thin layer chromatography (TLC) separation 
of pigments formed by E. coli cells expressing DNA encoding cytochrome P450 2A6. The 
15 pigments (1,2,3,4) were separated by thin layer chromatography TLC as described in 
Example 3. The band ("O") at the TLC origin was subsequently shown by visible, ] HNMR 
and mass spectrometry to be indigo, identical to compound 4 but not eluted due to the 
insoluble nature of the material. 

20 Figure 4 is a photographic representation of spectra of pigments formed by E. coli cells 
expressing DNA encoding cytochrome 450 monooxygenase 2A6. Pigments (1,2,3,4) were 
separated by TLC as described and spectra of pigments 2 and 4 were recorded in CHCI3. 
The spectrum of pigment 2 is characteristic of indirubin whereas pigment 4 corresponds to 
indigo. 

25 

Figure 5 is a graphical representation of 1 HNMR spectra of pigment 2 (indirubin)produced 
by cytochrome P450 monooxgenase 2A6 (Figure 3), at 400 MHz in d6-Me 2 SO. 
Assignments are indicated. The connectivities and 2-D spectra are shown on the structure. 
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Figure 6 is a graphical representation of 1 HNMR spectra of pigment 4 (indigo) produced 
by cytochrome P450 monooxgenase 2A6 (Figure 3), at 400 MHz in d6-Me 2 SO. 
Assignments are indicated. The connectivities and 2-D spectra are shown on the structure. 

5 Figure 7 is a photographic representation of TLC showing in vitro production of blue 
pigments by bacterial membranes containing human cytochrome P450 protein and human 
NADPH-dependent cytochrome P450 reductase (hNPR). Each cytochrome P450 form is 
indicated. The arrow indicates the position of indigo. In the case of incubations containing 
P450 2A6, the indigo produced remained largely at the origin due to insolubility, a 
10 phenomenon also seen in Figure 3. 

Figure 8 is a graphical representation showing relative in vitro production of blue 
pigments by bacterial membranes containing human cytochrome P450 protein and human 
NADPH-dependent cytochrome P450 reductase (hNPR). Each cytochrome P450 form is 
15 indicated. For cytochrome P450 monooxygenase 2C9, the various allelic variants are 
indicated as "*1", "*2", "*3". "HL16" is a human liver microsomal sample. In all cases, 
0.8 mM indole was incubated with the membranes (0.2 /xM P450) for 30 min at 37°C and 
dye production was analyzed by measuring absorbance at 665 nm. 

20 Figure 9 is a representation of a timecourse showing the production of various metabolites 
from indole over time in incubations with recombinant P450 2A6 and hNPR in bacterial 
membranes. Experiments were performed as described in Example 6. Aei5 indicates total 
absorbance at 615 nm, indicating pigment production. tR 3.3 indicates a metabolite with 
retention time of 3.3 min under the HPLC conditions used. Other metabolites are indicated 

25 on the figure. 

Figure 10 is a representation of the postulated reactions in cytochrome P450 
monooxygenase-catalyzed pigment formation in E. coli. 

30 Figure 11 is a diagrammatic representation showing postulated chemical mechanisms for 
cytochrome P450 monooxygenase-catalyzed pigment formation in E. coli. 
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Figure 12 is a diagrammatic representation showing the general strategy for subcloning 
cytochrome P450 and hNPR cDNAs into vectors for expression in plants. In the cases of 
P450 2A6 and hNPR, which contain an internal Sad site, the final ligation of the ribosome 
5 binding site (RBS) coding sequence and nos terminator into p2.5GEM-II was done in two 
separate steps. 

A summary of the sequence identifiers used in the subject specification is shown in Table 
1. 
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TABLE 1 
Summary of Sequence Identifiers 



SEQUENCE IDENTIFIER 



DESCRIPTION 



<400>1 

<400>2 
<400>3 

<400>4 
<400>5 

<400>6 
<400>7 

<400>8 
<400>9 

<400>10 
<400>11 

<400>12 
<400>13 



Nucleotide sequence encoding human derived cytochrome 
P450 2A6 monooxygenase for expression in bacterial cells 

Amino acid sequence of P450 2A6 encoded by <400>1 

Nucleotide sequence encoding human derived cytochrome 
P450 2E1 monooxygenase for expression in bacterial cells 

Amino acid sequence of P450 2E1 encoded by <400>3 

Nucleotide sequence encoding human derived cytochrome 
P450 2C19 monooxygenase for expression in bacterial 
cells 

Amino acid sequence of P450 2C19 encoded by <400>5 

Nucleotide sequence encoding human derived cytochrome 
P450 2A6 monooxygenase for expression in eukaryotic 
cells 

Amino acid sequence of P450 2A6 encoded by <400>7 

Nucleotide sequence encoding human derived cytochrome 
P450 2E1 monooxygenase for expression in eukaryotic 
cells 

Amino acid sequence of P450 2E1 encoded by <400>9 

Nucleotide sequence encoding human derived cytochrome 
P450 2C19 monooxygenase for expression in eukaryotic 
cells 

Amino acid sequence of P450 2C19 encoded by <400>1 1 

Nucleotide sequence encoding human derived cytochrome 
P450 reductase (NPR) for expression in bacterial cells 



<400>14 



Amino acid sequence of NPR encoded by <400>13 
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SEQUENCE IDENTIFIER 


DESCRIPTION 


<400>15 


Nucleotide sequence of human derived NPR for 




expression in eukaryotic cells 


<4U0>lo 


Amino acio. sequence oi JNrK encoded oy <4UU>13 


<400>17 


Nucleotide sequence of genetic construct for bacterial 




production oi r43U z/vo 


<4UU>lo 


Nucleotide sequence of genetic construct for bacterial 




production fof P450 2E1 


<400>19 


Nucleotide sequence of genetic construct for bacterial 




nrnduet of P4S0 2C1 9 


<400>20 


Nucleotide sequence encoding P450 cin, a cytochrome 




P450 monooxygenase from Citrobacter brakii 


<400>21 


Amino acid sequence of P450 cin encoded by <400>20 


<400>22 


Nucleotide sequence encoding flavodoxin 


<400>23 


Amino acid sequence of flavodoxin encoded by <400>22 
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A summary of abreviations used in the subject specification is provided in Table 2. 

TABLE 2 
Abbreviations 



ABBREVIATION 



DESCRIPTION 



P450 A26 
P450 2E1 
P450 2C19 

P450 cin 

IPTG 

TLC 

NPR 
hNPR 

Associated protein 



Cytochrome P450 2A6 monooxygenase 

Cytochrome P450 2E1 monooxygenase 

Cytochrome P450 2C19 monooxygenase 

Cytochrome P450 cin monooxygenase from Citrobacter 
brakii 

Isopropyl-P-D-thiogalactoside 

Thin layer chromatography 

NADPH-cytochrome P450 reductase 

Human-derived NADPH-cytochrome P450 reductase 

A protein which catalyzes transfer of electrons from a co- 
enzyme to prosthetic haem group on cytochrome P450 
protein or facilitates interaction of cytochrome P450 
protein and a cytochrome P450 reductase 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Accordingly, one aspect of the present invention is directed to the use of genetic material 
encoding a cytochrome P450 protein or a functional derivative or homologue thereof in the 
5 generation of a genetically modified cell, which cell has the capacity to produce a pigment 
in the presence of indole or a precursor, analogue or derivative thereof upon expression of 
said genetic material. 

More particularly, the present invention provides for the use of genetic material encoding a 
10 cytochrome P450 protein or a functional derivative or homologue thereof in the generation 
of a transgenic plant which transgenic plant has the capacity to produce a pigment in all or 
part of its tissue or non-cellular material in the presence of indole or a precursor, analogue 
or derivative thereof upon expression of said genetic material. 

15 The cytochrome P450 protein is preferably a cytochrome P450 mo^ooxygenase and 
requires one or more associated proteins for activity. The associated protein may be a 
reductase such as a co-enzyme-dependent cytochrome P450 reductase and/or may be 
cytochrome b 5 or a homologue thereof. The associated protein may be a naturally occurring 
molecule in the cell into which the genetic material encoding the cytochrome P450 protein 

20 is introduced or it may be introduced simultaneously with, subsequently to or separately 
from the genetic material encoding the cytochrome P450 protein. In one embodiment, the 
genetic material comprises a bicistronic construct encoding both the cytochrome P450 
protein and the associated protein. In another embodiment, the genetic material comprises 
two or more separate monocistronic constructs each encoding a cytochrome P450 protein 

25 and one or more associated proteins. In still yet another embodiment, the genetic material 
encodes a cytochrome P450 protein and its activity is facilitated by one or more host cell 
associated proteins. In even yet another aspect, the genetic material is a tricistronic or 
multicistronic construct encoding a cytochrome P450 protein and at least two associated 
proteins. 

30 
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Another aspect of the present invention provides for a use of genetic material encoding a 
cytochrome P450 protein or a derivative or homologue thereof in the generation of a 
genetically modified cell, which cell has the capacity to produce a pigment in the presence 
of indole or a precursor, analogue or derivative thereof upon expression of said genetic 
5 material wherein the genetic material, encoding the cytochrome P450 protein or its 
functional derivative or homologue, is co-expressed with genetic material encoding an 
associated protein. 

More particularly, the present invention is directed to a use of genetic material encoding a 
10 cytochrome P450 protein or a derivative or homologue thereof in the generation of a 
transgenic plant which transgenic plant has the capacity to produce a pigment in all or part 
of its tissue or non-cellular material in the presence of indole or a precursor, analogue or 
derivative thereof upon expression of said genetic material wherein the genetic material 
encoding the cytochrome P450 protein or its functional derivative or homologue is co- 
15 expressed with genetic material encoding an associated protein. The cell may be a 
microbial cell (e.g. a bacterial, yeast or fungal cell), an animal cell (e.g. an insect, fish, 
amphibian, avian or mammalian cell) or a plant cell. 

The associated protein may be indigenous to the cell or may have been previously 
20 introduced. The present invention covers both homologous and heterologous cytochrome 
P450 protein-associated protein combinations. A homologous combination is one which is 
naturally occurring. That is, the associated protein is one which is normally associated with 
the cytochrome P450 protein in a naturally occurring cell. A heterologous combination is 
one where the associated protein is not normally associated with the cytochrome P450 
25 enzyme. An associated protein includes a molecule such as cytochrome bs or a functional 
homologue or derivative thereof. The term "co-expressed'* includes co-expression on a 
single bicistronic, tricistronic or multicistronic construct, expression of separate 
monocistronic constructs and expression of one or more genetic construct together with 
expression of a nucleotide sequence contained in genomic DNA. 

30 
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The preferred associated protein is a protein which can catalyze the transfer of electrons. 
One example of a reductase is a co-enzyme-dependent cytochrome P450 reductase. One 
particularly useful reductase in the practice of the present invention is an NADPH- 
cytochrome P450 reductase or "NPR" which catalyses the transfer of electrons from 
5 NADPH via FAD and/or FMN to the prosthetic haem group on the cytochrome P450 
molecule. A particularly useful NPR is a human-derived NPR which is designated herein 
"Hnpr". Another useful reductase is a redoxin reductase. The preferred cytochrome P450 
protein is a monooxygenase and is of mammalian or microbial origin. Particularly useful 
cytochrome P450 monooxygenases in the practice of the present invention are of human or 
10 primate origin. The present invention extends, however, to cytochrome P450 
monooxygenases of bacterial or eukaryotic origin. 

Another aspect of the present invention is directed to the use of genetic material encoding a 
mammalian or microbial cytochrome P450 monooxygenase or a functional derivative or 

15 homologue thereof in the generation of a genetically modified cell, which cell has the 
capacity to produce a pigment in the presence of indole or a precursor, analogue or 
derivative thereof and upon expression of said genetic material wherein said genetic 
material encoding the cytochrome P450 monooxygenase or its functional derivative or 
homologue is co-expressed with genetic material encoding an associated protein selected 

20 from a co-enzyme-dependent cytochrome P450 reductase and/or a cytochrome bs or a 
functional derivative or homologue of either molecule. 

More particularly, the present invention provides for the use of genetic material encoding a 
mammalian or microbial cytochrome P450 monooxygenase or a functional derivative or 

25 homologue thereof in the generation of a transgenic plant which transgenic plant has the 
capacity to produce a pigment in all or part of its tissue or non-cellular material in the 
presence of indole or a precursor, analogue or derivative thereof and upon expression of 
said genetic material wherein said genetic material encoding the cytochrome P450 
monooxygenase or its functional derivative or homologue is co-expressed with genetic 

30 material encoding an associated protein selected from a co-enzyme-dependent cytochrome 
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P450 reductase and/or a cytochrome b$ or a functional derivative or homologue of either 
molecule. 

A preferred reductase is an NADPH-dependent cytochrome P450 reductase. 

5 

As stated above, the term "co-expression" includes expression of genetic material 
encoding both the cytochrome P450 monooxygenase and the associated protein in a 
bicistronic, tricistronic or multicistronic construct as well as expression of separate genetic 
constructs or co-expression of the cytochrome P450 monooxygenase genetic material with 
genetic material encoding an associated protein indigenous to the cell into which the 
genetic material encoding the cytochrome P450 monooxygenase is introduced. 

The term "non-cellular material" in relation to plants includes vascular regions such as the 
phloem and xylem, fibrous material such as cotton and non-living regions of the plant. 

Although not intending to limit the present invention to any one theory or mode of action, 
it is proposed that the cytochrome P450 protein and the associated protein(s) convert 
indole to indoxyl. This compound may then spontaneously oxidize in the presence of 
oxygen to oxidized indoxyl and then be spontaneously transformed to indigo. This process 
may also be catalyzed by the cytochrome P450 protein. Alternatively, indoxyl may be 
further transformed by cytochome P450 catalysis to indigo by a different mechanism (see 
Figure 11). Another product of cytochrome P450-dependent oxidation of indole is isatin. 
Another pigment, indirubin, is also generated when isatin combines with indoxyl. The 
pigments produced may remain in the cells, tissues or organs such as, in the case of plants, 
cells of petals, flowers, stems, leaves or seeds, may leak to surrounding cellular or non- 
cellular regions or may be transported to other regions via the plant vascular system. The 
pigment may also be present in fibrous material such as cotton or even non-cellular 
material. By expressing the genetic material encoding the cytochrome P450 protein and the 
associated protein with a developmentally or organ specific promoter, particular parts of 
the plant can be targeted for a colour change. The colour is generally dark, such as dark 
blue or black or maybe a purple to red colour. Different shades of colours may be obtained 
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by providing chemically modified substrates for the cytochrome P450 monooxygenases or 
providing to the plant modified substrates or compounds which before or after cytochrome 
P450-dependent metabolism may complex with endogenous indole or a precursor, 
analogue, derivative or metabolites thereof. In addition, a range of colours may be obtained 
5 following interactions between indigo and other related dyes or their precursors and plant 
molecules associated with pigmentation such as flavonoid molecules. 

All plants are encompassed by the present invention, i.e. monocotyledonous plants and 
dicotyledonous plants. Particularly useful plants are flower crop plants including rose, 
10 carnation, petunia, lisianthus, lily, iris, tulip, freesia, delphinium, limonium, pelargonium, 
commercial crops such as cotton and food crops (such as wheat, rice, barley and sugar 
cane). 

Yet another aspect of the present invention provides a method of inducing a colour change 
15 in a plant or part of a plant, said method comprising introducing into plant cells a genetic 
construct encoding a cytochrome P450 protein or a functional derivative or homologue 
thereof, regenerating a plant from said plant cells and subjecting said plant to appropriate 
conditions sufficient to permit expression of a nucleotide sequence on said genetic 
construct which encodes said cytochrome P450 protein such that in the presence of indole 
20 or a precursor, analogue or derivative thereof said cytochrome P450 protein facilitates the 
conversion of indole or its precursor, analogue or derivative to an intermediate which is 
capable of oxidation to a pigment. 

Preferably, the cytochrome P450 protein is a monooxygenase of microbial or mammalian 
25 origin. The term "microbial origin" includes bacterial origin. Reference to a "cytochrome 
P450 protein includes and encompasses hybrid molecules from two or more cytochrome 
P450 proteins as well as naturally occurring and artificially created mutants and 
derivatives. 

30 The activity of the cytochrome P450 protein requires the presence of one or more 
accessory proteins which may be present in and be indigenous to the plant cells or may 
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also be introduced on a separate genetic construct or on the same genetic construct which 
encodes the cytochrome P450 protein. 

In one aspect, the indole (or its precursor) covereted to indigo or its precursor is 
5 endogenous indole. However, increased levels of indole or its precursor, analogue or 
derivative thereof may be provided to the plant via, for example, its root system such as in 
a fertilizer. Alternatively, cut flowers or other severed portions of the plant may be placed 
in a solution of indole or precursor, analogue or derivative thereof The vascular system, in 
the case of a plant, will then transport the indole into the plant tissue cells and pigment 

10 production will occur in the presence of the cytochrome P450 protein and the associated 
protein. In a further aspect of the invention, cells are engineered to express genetic material 
encoding a tryptophanase or another enzyme in order to promote and/or endogenous levels 
of indole or its precursors or homologues. Preferably, the genetic material encoding the 
tryptophanase or other enzymes would be able to be regulated at the developmental and/or 

1 5 organ and/or tissue specific levels. 

This aspect of the present invention, insofar as it relates to plants, further extends to 
progeny of the plants engineered to express the cytochrome P450 protein as well as 
vegetative, propagative and reproductive parts of the plants (including cuttings, pollen, 
20 seeds and callus). 

Another aspect of the present invention provides a genetically modified cell or 
multicellular animal or plant or progeny thereof or parts of said transgenic plant or animal 
wherein said transgenic plant or animal comprises expressible genetic material encoding a 
25 microbial or mammalian cytochrome P450 protein. 

More particularly, the present invention provides a transgenic plant or progeny thereof or 
parts of said transgenic plant wherein said transgenic plant comprises expressible genetic 
material encoding a microbial or mammalian cytochrome P450 protein. 

30 

Preferably, the cytochrome P450 protein is a monooxygenase. 
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Even more particularly, the present invention provides a genetically modified cell or 
multicellular animal or plant or progeny or parts thereof comprising expressible genetic 
material encoding a microbial or mammalian cytochrome P450 monooxygenase such that 
5 upon expression of said genetic material and in the presence of an associated protein and 
indole or a precursor, analogue or derivative thereof, the indole or its precursor, analogue 
or derivative is converted to an intermediate which is capable of oxidation to a pigment. 

Still more particularly, the present invention is directed to a transgenic plant or progeny or 
10 parts thereof comprising expressible genetic material encoding a microbial or mammalian 
cytochrome P450 monooxygenase such that upon expression of said genetic material and 
in the presence of an associated protein and indole or a precursor, analogue or derivative 
thereof, the indole or its precursor, analogue or derivative is converted to an intermediate 
which is capable of oxidation to a pigment. 

15 

The term "genetically modified" is used in its broadest sense and includes introducing 
gene(s) into cells, mutating gene(s) in cells and altering or modulating the regulation of 
gene(s) in cells. 

20 A "part" of a plant includes flowers (e.g. cut or severed flowers), petals, stems, leaves and 
fibrous material such as cotton and vegetative, propagative and reproductive material such 
as cuttings, pollen, seeds and callus. 

Generally, the coding region(s) on the genetic material are operably linked to a single or 
25 multiple promoters. In one embodiment, the genetic construct is a bicistronic construct 
under the control of (i.e. operably linked to) a single promoter where the cistrons encode 
both the cytochrome P450 protein and the associated protein. In another embodiment, the 
genetic construct is a tricistronic or multicistronic construct. 



30 



As used herein, the term "genetic material" refers to any single-stranded or double- 
stranded nucleic acid molecule which at least comprises deoxyribonucleotides and/or 
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ribonucleotides, including DNA (cDNA or genomic DNA), RNA, mRNA, or tRNA, 
amongst others. The combination of such molecules with non-nucleotide substituents 
derived from synthetic means or naturally-occurring sources is also contemplated by the 
present invention. 

5 

"Derivatives" of the genetic sequence of the invention refers to any isolated nucleic acid 
molecule which comprises at least 10 and preferably at least 20 contiguous nucleotides 
derived from the genetic sequence as described herein according to any embodiment. A 
derivative includes a part, fragment, portion or analogue. A derivative also includes a 
10 fusion molecule between two more genetic sequences encoding cytochrome P450 
molecules. 

An "analogue" of a genetic sequence of the invention means any isolated nucleic acid 
molecule which is substantially the same as a nucleic acid molecule of the present 

15 invention or its complementary nucleotide sequence as described herein according to any 
embodiment, notwithstanding the occurrence of any non-nucleotide constituents not 
normally present in said isolated nucleic acid molecule, for example carbohydrates, 
radiochemicals including radionucleotides, reporter molecules such as, but not limited to, 
alkaline phosphatase or horseradish peroxidase, amongst others. A "homologue" is a 

20 functionally similar molecule from a different species or strain. 

Generally, analogues or derivatives of the nucleic acid molecule of the invention are 
produced by synthetic means or alternatively, derived from naturally-occurring sources. 
For example, the nucleotide sequence of the present invention may be subjected to 
25 mutagenesis to produce single or multiple nucleotide substitutions, deletions and/or 
insertions. 

The genetic sequence of the present invention may comprise a sequence of nucleotides or 
be complementary to a sequence of nucleotides which comprise one or more of the 
30 following: a promoter sequence, a 5' non-coding region, a c/s-regulatory region such as a 
functional binding site for transcriptional regulatory protein or translational regulatory 
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protein, an upstream activator sequence, an enhancer element, a silencer element, a TATA 
box motif, a CCAAT box motif, or an upstream open reading frame, transcriptional start 
site, translational start site, and/or nucleotide sequence which encodes a leader sequence. 
The genetic sequence also encodes the cytochrome P450 enzyme and optionally an 
5 associated protein (e.g. cytochrome P450 reductase and/or cytochrome bs). 

The term "5' non-coding region" is used herein in its broadest context to include all 
nucleotide sequences which are derived from the upstream region of an expressible gene, 
other than those sequences which encode amino acid residues which comprise the 
10 polypeptide product of said gene, wherein 5' non-coding region confers or activates or 
otherwise facilitates, at least in part, expression of the gene. 

The term "gene" is used in its broadest context to include both a genomic DNA region 
corresponding to the gene as well as a cDNA sequence corresponding to exons or a 
15 recombinant molecule engineered to encode a functional form of a product. 

As used herein, the term "exacting sequence" or "czs-regulatory region" or similar term 
shall be taken to mean any sequence of nucleotides which is derived from an expressible 
genetic sequence wherein the expression of the first genetic sequence is regulated, at least 
20 in part, by said sequence of nucleotides. Those skilled in the art will be aware that a cis- 
regulatory region may be capable of activating, silencing, enhancing, repressing or 
otherwise altering the level of expression and/or cell-type-specificity and/or developmental 
specificity of any structural gene sequence. 

25 Reference herein to a "promoter" is to be taken in its broadest context and includes the 
transcriptional regulatory sequences of a classical genomic gene, including the TATA box 
which is required for accurate transcription initiation, with or without a CCAAT box 
sequence and additional regulatory elements (i.e. upstream activating sequences, enhancers 
and silencers) which alter gene expression in response to developmental and/or 

30 environmental stimuli, or in a tissue-specific or cell-type-specific manner. A promoter is 
usually, but not necessarily, positioned upstream or 5', of a structural gene, the expression 
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of which it regulates. Furthermore, the regulatory elements comprising a promoter are 
usually positioned within 2kb of the start site of transcription of the gene. 

In the present context, the term "promoter" is also used to describe a synthetic or fusion 
5 molecule, or derivative which confers, activates or enhances expression of a structural 
gene or other nucleic acid molecule, in a plant cell. Preferred promoters according to the 
invention may contain additional copies of one or more specific regulatory elements to 
further enhance expression in a cell, and/or to alter the timing of expression of a structural 
gene to which it is operably connected. 

10 

The term "operably connected" or "operably linked" in the present context means placing a 
structural gene (e.g. cytochrome P450 protein or associated protein) under the regulatory 
control of a promoter which then controls expression of the gene. Promoters and the like 
are generally positioned 5' (upstream) to the genes which they control. In the construction 

15 of heterologous promoter/structural gene combinations, it is generally preferred to position 
the genetic sequence or promoter at a distance from the gene transcription start site that is 
approximately the same as the distance between that genetic sequence or promoter and the 
gene it controls in its natural setting, i.e., the gene from which the genetic sequence or 
promoter is derived. As is known in the art, some variation in this distance can be 

20 accommodated without loss of function. Similarly, the preferred positioning of a regulatory 
sequence element with respect to a heterologous gene to be placed under its control is 
defined by the positioning of the element in its natural setting, i.e., the genes from which it 
is derived. 

25 A further aspect of the present invention provides a transfected or transformed cell, tissue, 
organ or non-cellular material which contains or is capable of producing cytochrome P450 
protein or a functional derivative or homologue thereof. Preferably, the cytochrome P450 
protein is a cytochrome P450 monooxygenase. This material will also contain or be 
capable of producing an associated protein such as a co-enzyme-dependent cytochrome 

30 P450 reductase or cytochrome b$ or a functional homologue or derivative of either. 



WO 01/14565 



PCT/AU00/00966 



-29- 

The genetic construct(s) of the present invention may be introduced into a cell by various 
techniques known to those skilled in the art. The technique used may vary depending on 
the known successful techniques for that particular organism. 

5 Techniques for introducing recombinant DNA into cells include, but are not limited to, 
transformation using CaCb and variations thereof, direct DNA uptake into protoplasts, 
PEG-mediated . uptake to protoplasts, microparticle bombardment, electroporation, 
microinjection of DNA, microparticle bombardment of tissue explants or cells, vacuum- 
infiltration of tissue with nucleic acid, and T-DNA-mediated transfer from Agrobacterium 
10 to the plant tissue. 

For microparticle bombardment of cells, a microparticle is propelled into a cell to produce 
a transformed cell. Any suitable ballistic cell transformation methodology and apparatus 
can be used in performing the present invention. Exemplary apparatus and procedures are 
15 disclosed by Stomp et al (U.S. Patent No. 5,122,466) and Sanford and Wolf (U.S. Patent 
No. 4,945,050). When using ballistic transformation procedures, the genetic construct may 
incorporate a plasmid capable of replicating in the cell to be transformed. 

Examples of microparticles suitable for use in such systems include 0.1 to 10 jim and more 
20 particularly 10.5 to 5 nm tungsten or gold spheres. The DNA construct may be deposited 
on the microparticle by any suitable technique, such as by precipitation. 

Once introduced into cells such as plant tissue, the expression of a cytochrome P450 
protein and/or associated protein may be assayed in a transient expression system or it may 
25 be determined after selection for stable integration within for example, the plant genome. 

Plant tissue capable of subsequent clonal propagation, whether by organogenesis or 
embryogenesis, may be transformed with a genetic construct of the present invention and a 
whole plant generated therefrom. The particular tissue chosen will vary depending on the 
30 clonal propagation systems available for, and best suited to, the particular species being 
transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, 



WO 01/14565 



PCT/AU00/00966 



-30- 

hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g. apical 
meristem, axillary buds, and root meristems), and induced meristem tissue (e.g. cotyledon 
meristem and hypocotyl meristem). 

5 The regenerated transformed plants may be propagated by a variety of means, such as by 
clonal propagation or classical breeding techniques. For example, a first generation (or Tl) 
transformed plant may be selfed to give homozygous second generation (or T2) 
transformant, and the T2 plants further propagated through classical breeding techniques. 

10 Intended recipient plants of the cytochrome P450 protein or its derivative or homologue 
and/or associated protein or proteins (e.g. a cytochrome P450 reductase or cytochrome bs) 
may first be modified such that the level of available indole or its precursor, analogue or 
derivative thereof for conversion to pigments (e.g. indigo) is altered. Generally, but not 
necessarily, the level is increased. A range of mutations in amino acid biosynthesis or 

15 metabolic pathways may be induced to result in accumulation of indole or an indole 
precursor. Alternatively, the indole or its precursor, analogue or derivative thereof is 
provided exogenously such as via a fertilizer or via a solution applied to the plant or parts 
of the plant. 

20 The present invention is further directed to flowers (e.g. cut or severed flowers), parts of 
plants, fibrous material from plants (e.g. cotton), non-cellular material and reproductive 
portions of plants (e.g. cuttings, seeds, pollen, callus) which are coloured or have the 
capacity to be coloured in the presence of indole or a precursor thereof due to the presence 
of cytochrome P450 protein (e.g. monooxygenase) and an associated protein (e.g. 

25 NADPH-dependent cytochrome P450 reductase or cytochrome b 5 ). 

Reference herein to "mammalian" in relation to a cytochrome P450 protein or associated 
protein includes a human, primate, livestock animal (e.g. sheep, pig, cow, horse, donkey), 
laboratory test animal (e.g. rabbit, mouse, guinea pig, hamster), companion animal (e.g. 
30 cat, dog) or a captive wild animal Preferred mammals are primates and humans. 
Particularly preferred mammals are humans. Reference to "bacterial" includes prokaryotes 
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such as E. coli, Pseudomonas sp, Acinetobacter sp, and Citrobacter sp; the term 
"microbial" includes bacteria but also includes eukaryotic cells such as yeast, fungi and 
animal cells, e.g. insect cells. 

5 Any and all microbial and mammalian cytochrome P450 proteins and in particular 
cytochrome P450 monooxygenases are contemplated by the present invention provided 
that such monooxygenases can act on indole or its precursors, analogues or derivatives. 
Preferred cytochrome P450 proteins comprise the consensus amino acid sequence 
(F/P/W/L/Y)(G/S/C/N/H/E)X (G/D/A/E)X(R/H/S/K/T)XCX!(G/A) wherein X is any 

10 amino acid and Xi is selected from V, M, A, L, I, P, F or T and wherein where residues in 
parentheses represent alternatives at a single position. More particularly, the cytochrome 
P450 monooxygenases comprise the consequence sequence 
(F/L)(G/S)XGX(R/H)XCXi(G/A) where X is any amino acid and Xj is selected from V, 
M, A, L, I, P, F or T and where residues in parentheses represent alternatives at a single 

15 position. The cytochrome P450 monooxygenase may also comprise a hybrid from two or 
more cytochrome P450 proteins. The cytochrome P450 monooxygenases according to this 
aspect of the present invention are capable of converting indole or a precursor, analogue or 
derivative to an intermediate which is capable of oxidation to a pigment. Conveniently, a 
suitable cytochrome P450 monooxygenase having the above consensus sequence and, in 

20 the presence of an associated protein and indole or a precursor, analogue or derivative 
thereof, provides a pigment or a precursor of a pigment, may be selected from Genbank. 
Alternatively, a Nelson website may be selected such as: 
http://drnelson.utmem.edu/CvtochromeP450.html : 
http://drnelson.utmem.edu/p450apubl92.html; and 

25 http://drnelson.utmem.edu/p450bpub237.html. 

The consensus sequence is obtained by comparing the amino acid sequence of the haem 
binding region of P450 proteins. 

30 Particularly preferred cytochrome P450 monooxygenases include the human-derived P450 
2A6, P450 2E1 and P450 2C19 monooxygenases. The nucleotide sequences encoding 
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these monooxygenases are given in <400>1, <400>3 and <400>5, respectively 
(corresponding amino acid sequences are given in <400>2, <400>4 and <400>6, 
respectively). These sequences are specifically designed for expression in bacterial cells. 
The nucleotide sequence given in <400>7, <400>9 and <400>11 (with corresponding 
5 amino acid sequences given in <400>8, <400>10 and <400>12, respectively) are 
respectively for P450 2A6, P450 2E1 and P450 2C19 when in a form suitable for 
expression in eukaryotic cells such as plant cells. 

Accordingly, the present invention provides for the use of a genetic sequence comprising a 
10 nucleotide sequence expressible in plant cells which encodes a cytochrome P450 
monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
15 pigment wherein the cytochrome P450 monooxygenase comprises the amino acid 
sequence (F/PAV/L/Y)(G/S/C/N/H/E)X (G/D/A/E)X(R/H/S/K/T)XCX,(G/A) wherein X is 
any amino acid and Xi is selected from V, M, A, L, I, P, F or T and wherein where 
residues in parentheses represent alternatives at a single position. 

20 More particularly, the present invention provides for the use of a genetic sequence 
comprising a nucleotide sequence expressible in plant cells which encodes a cytochrome 
P450 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 

25 precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the cytochrome P450 monooxygenase comprises the amino acid 
sequence (F/L)(G/S)XGX(R/H)XCXj(G/A) where X is any amino acid and X { is selected 
from V, M, A, L, I, P, F or T and where residues in parentheses represent alternatives at a 
single position. 

30 
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Even more particularly, the present invention is directed to the use of a genetic sequence 
comprising a nucleotide sequence expressible in plant cells which encodes a cytochrome 
P450 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
5 precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the nucleotide sequence is selected from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in <40Q>8, 
10 <400>10or<400>12; 

(ii) a nucleotide sequence defined by <400>7, <400>9 or <40Q>1 1 ; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
15 or(ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>2, <400>4 or <400>6; 

20 (v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>1, <400>3 or <400>5; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

25 

The cytochrome P450 monooxygenase of the present invention includes modified forms of 
the enzyme such as those resulting from site-directed or random mutagenesis as well as 
naturally occurring mutant forms. 

30 The cytochrome P450 monooxygenase requires an associated protein such as NADPH- 
dependent cytochrome P450 (NPR) or cytochrome b 5 . One particularly useful associated 
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protein is NPR. The nucleotide and corresponding amino acid sequences for an NPR from 
human cells are given in <400>13 and <400>14 (for use in bacteria) and <400>15 and 
<400>16 (for use in plants). 

5 Accordingly, the present invention provides for the use of a genetic sequence comprising a 
nucleotide sequence expressible in plant cells which encodes a cytochrome P450 
monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
10 precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the genetic sequence further comprises a nucleotide sequence encoding 
an NPR wherein the nucleotide sequence is selected from:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>14 and 
15 <400>16; 

(ii) a nucleotide sequence defined by <400> 1 3 and <400> 15; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
20 or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>14 and <400>16; 

25 (v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>13 and <400>15; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

30 
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Another useful cytochrome P450 monooxygenase is of bacterial origin such as from 
Citrobacter sp. One particularly useful cytochrome P450 monooxygenase is from 
Citrobacter brakii. The nucleotide sequence and corresponding amino acid sequence is 
shown in <400>20 and <400>21, respectively. Its corresponding associated protein is a 
5 flavodoxin. The nucleotide sequence encoding flavodoxin and its corresponding amino 
acid sequence is shown in <400>22 and <400>23, respectively. 

Accordingly, the present invention provides for the use of a genetic sequence comprising a 
nucleotide sequence expressible in plant cells which encodes a cytochrome P450 
10 monooxygenase in the generation of a transgenic plant or transformed cells of a plant 
which, upon expression of said nucleotide sequence, and in the presence of indole or a 
precursor, analogue and/or derivative thereof, have the capacity to convert indole or a 
precursor, analogue or derivative thereof to an intermediate which is oxidizable to a 
pigment wherein the nucleotide sequence is selected from:- 

15 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>21 ; 

(ii) a nucleotide sequence defined by <400>20; 



20 (iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>21 ; 

25 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>20; or 



(vi) 

30 



a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 
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In accordance with this embodiment, the bacterial cytochrome P450 monooxygenase is 
preferably operable in the presence of a flavodoxin or other suitably associated protein. 
Accordingly, another aspect of the present invention provides for the use of a genetic 
sequence comprising a nucleotide sequence expressible in plant cells which encodes a 
5 cytochrome P450 monooxygenase in the generation of a transgenic plant or transformed 
cells of a plant which, upon expression of said nucleotide sequence, and in the presence of 
indole or a precursor, analogue and/or derivative thereof, have the capacity to convert 
indole or a precursor, analogue or derivative thereof to an intermediate which is oxidizable 
to a pigment wherein the genetic sequence further comprises a nucleotide sequence 
10 encoding an bacterial flavodoxin wherein the nucleotide sequence is selected from:- 



(i) a nucleotide sequence encoding the amino acid sequence defined in <400>23; 

(ii) a nucleotide sequence defined by <400>22; 

15 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
20 encoding the amino acid sequence defined in <400>23; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>22; or 

25 (vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 



The term "similarity" as used herein includes exact identity between compared sequences 
at the nucleotide or amino acid level. Where there is non-identity at the nucleotide level, 
30 "similarity" includes differences between sequences which result in different amino acids 
that are nevertheless related to each other at the structural, functional, biochemical and/or 
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conformational levels. Where there is non-identity at the amino acid level, "similarity" 
includes amino acids that are nevertheless related to each other at the structural, functional, , 
biochemical and/or conformational levels. In a particularly preferred embodiment, 
nucleotide and sequence comparisons are made at the level of identity rather than 
5 similarity. 

Terms used to describe sequence relationships between two or more polynucleotides or 
polypeptides include "reference sequence", "comparison window", "sequence similarity", 
"sequence identity", "percentage of sequence similarity", "percentage of sequence 
10 identity", "substantially similar" and "substantial identity". A "reference sequence" is at 

j 

least 12 but frequently 15 to 18 and often at least 25 or above, such as 30 monomer units, 
inclusive of nucleotides and amino acid residues, in length. Because two polynucleotides 
may each comprise (1) a sequence (i.e. only a portion of the complete polynucleotide 
sequence) that is similar between the two polynucleotides, and (2) a sequence that is 

15 divergent between the two polynucleotides, sequence comparisons between two (or more) 
polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window" to identify and compare local regions of 
sequence similarity. A "comparison window" refers to a conceptual segment of typically 
12 contiguous residues that is compared to a reference sequence. The comparison window 

20 may comprise additions or deletions (i.e. gaps) of about 20% or less as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal alignment 
of the two sequences. Optimal alignment of sequences for aligning a comparison window 
may be conducted by computerised implementations of algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics 

25 Computer Group, 575 Science Drive Madison, WI, USA) or by inspection and the best 
alignment (i.e. resulting in the highest percentage homology over the comparison window) 
generated by any of the various methods selected. Reference also may be made to the 
BLAST family of programs as for example disclosed by Altschul et aL (1997). A detailed 
discussion of sequence analysis can be found in Unit 19.3 of Ausubel et al. (1998). 

30 
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The terms "sequence similarity" and "sequence identity" as used herein refers to the extent 
that sequences are identical or functionally or structurally similar on a nucleotide-by- 
nucleotide basis or an amino acid-by-amino acid basis over a window of comparison. 
Thus, a "percentage of sequence identity", for example, is calculated by comparing two 
5 optimally aligned sequences over the window of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g. A, T, C, G, I) or the identical amino 
acid residue (e.g. Ala, Pro, Ser, Thr, Gly, Val, Leu, He, Phe, Tyr, Trp, Lys, Arg, His, Asp, 
Glu, Asn, Gin, Cys and Met) occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 

10 window of comparison (i.e., the window size), and multiplying the result by 100 to yield 
the percentage of sequence identity. For the purposes of the present invention, "sequence 
identity" will be understood to mean the "match percentage" calculated by the DNASIS 
computer program (Version 2.5 for windows; available from Hitachi Software engineering 
Co., Ltd., South San Francisco, California, USA) using standard defaults as used in the 

15 reference manual accompanying the software. Similar comments apply in relation to 
sequence similarity. 

Reference herein to a low stringency includes and encompasses from at least about 0 to at 
least about 15% v/v formamide and from at least about 1 M to at least about 2 M salt for 

20 hybridization, and at least about 1 M to at least about 2 M salt for washing conditions. 
Generally, low stringency is at from about 25-30°C to about 42°C. The temperature may 
be altered and higher temperatures used to replace formamide and/or to give alternative 
stringency conditions. Alternative stringency conditions may be applied where necessary, 
such as medium stringency, which includes and encompasses from at least about 16% v/v 

25 to at least about 30% v/v formamide and from at least about 0.5 M to at least about 0.9 M 
salt for hybridization, and at least about 0.5 M to at least about 0.9 M salt for washing 
conditions, or high stringency, which includes and encompasses from at least about 31% 
v/v to at least about 50% v/v formamide and from at least about 0.01 M to at least about 
0.15 M salt for hybridization, and at least about 0.01 M to at least about 0.15 M salt for 

30 washing conditions. In general, washing is carried out T m = 69.3 + 0.41 (G+C)% (Marmur 

and Doty, 1962). However, the T m of a duplex DNA decreases by 1°C with every increase 
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of 1% in the number of mismatch base pairs (Bonner and Laskey, 1974). Formamide is 
optional in these hybridization conditions. Accordingly, particularly preferred levels of 
stringency are defined as follows: low stringency is 6 x SSC buffer, 0.1% w/v SDS at 25- 
42°C; a moderate stringency is 2 x SSC buffer, 0.1% w/v SDS at a temperature in the 
5 range 20°C to 65°C; high stringency is 0.1 x SSC buffer, 0.1% w/v SDS at a temperature 
of at least 65°C 

The present invention further contemplates the use of plant or plant material as a source of 
pigment for use, for example, in the dye or cosmetics industry, as commercial tags and as 

10 markers for transformation of microbial, plant and/or animal cells. The present invention 
further provides a genetic construct comprising a cistron encoding a microbial or 
mammalian cytochrome P450 protein for use, and more particularly, when used, in the 
generation of a transformed plant cell such that upon expression of said cistron, the 
cytochrome P450 facilitates conversion of indole or a precursor thereof to a molecule 

15 oxidizable to a pigment. Preferably, the cytochrome P450 protein is a mammalian or 
bacterial cytochrome P450 monooxygenase. 

Preferably, the cytochrome P450 monooxygenase comprises the amino acid sequence 
(F/PAV/I7Y)(G/S/C/N/H/E)X (G/D/A/E)X(R/H/S/K/T)XCXi(G/A) wherein X is any 
20 amino acid and Xi is selected from V, M, A, L, I, P, F or T and wherein where residues in 
parentheses represent alternatives at a single position. 

More preferably, the cytochrome P450 monooxygenase comprises the amino acid 
sequence (F/L)(G/S)XGX(R/H)XCXi(G/A) where X is any amino acid and X, is selected 
25 from V, M, A, L, I, P, F or T and where residues in parentheses represent alternatives at a 
single position. 

Even more preferably, the nucleotide sequence encoding the cytochrome P450 
monooxygenase is selected from> 

30 
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(i) a nucleotide sequence encoding the amino acid sequence defined in <400>8, 
<400>10, <400>12 or <400>21; 

(ii) a nucleotide sequence defined by <400>7, <400>9 or <400>1 1 or <400>20; 

5 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
10 encoding the amino acid sequence defined in <400>2, <400>4, <400>6 or 

<400>21; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>1, <400>3, <400>5 or <400>20; or 

15 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 



Preferably, the genetic construct comprises another cistron encoding an associated protein 
20 such as but not limited to a co-enzyme-dependent cytochrome P450 reductase (NPR) 
and/or cytochrome bs or a functional derivative or homologue thereof. 

A particularly useful associated protein is human NPR (hNPR) encoded by:- 

25 (i) a nucleotide sequence encoding the amino acid sequence defined in <400>16; 



(ii) a nucleotide sequence defined by <400> 1 5; 



(iii) 

30 



a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 
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(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
encoding the amino acid sequence defined in <400>14; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
5 defined in <400>13; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

10 Another useful associated protein is a bacterial flavodoxin encoded by:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in <400>23; 

(ii) a nucleotide sequence defined by <400>22; 

15 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide sequence in (i) 
or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
20 encoding the amino acid sequence defined in <400>23; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide sequence 
defined in <400>22; or 

25 (vi) a nucleotide sequence capable of hybridizing under low stringency conditions to a 
nucleotide sequence in one or more of (i), (ii), (iii), (iv) or (v). 

The above nucleotide sequences may encode the amino acid sequence of the naturally 
occurring molecule or a modified form such as resulting from site directed mutagenesis, 
30 random mutagenesis as well as naturally occurring mutant forms. 



WO 01/14565 



PCT/AUOO/00966 



-42- 

Another aspect of the present invention provides an isolated nucleic acid molecule 
comprising a nucleotide sequence encoding or complementary to a sequence encoding a 
cytochrome P450 cin monooxygenase from Citrobacter brakii. 

5 Preferably, the cytochrome P450 monooxygenase comprises an amino acid sequence 
substantially as set forth in <400>21 or an amino acid sequence having at least about 50% 
similarity thereto after optimal alignment. 

Preferably, the cytochrome P450 monooxygenase is encoded by a nucleotide sequence 
10 substantially as set forht in <400>20 or a nucleotide sequence having at least about 50% 
similarity thereto after optimal alignment or a nucleotide sequence capable of hybridizing 
to <400>20 or its complementary form under low stringency conditions. 

Aspects of the subject invention were made with United States Government Support under 
15 grant number CA44353 awarded by the National Institutes of Health. The United States 
Government may have certain rights in aspects of the instant invention. A licence for 
foreign filing for various aspects of the present invention has been obtained under Serial 
Number P-101,005. 

20 The present invention is further described by the following non-limiting Examples. 
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EXAMPLE 1 
Bacterial cultures 

Bicistronic plasmids for expression of P450s and NADPH-P450 reductase were 
5 constructed as previously described (Parikh et al t 1997). P450 2A6, P450 2C19, P450 
2D6, and the wild type and allelic variants of P450 2C9, were prepared using similar 
methods. P450 1B1 is described by Shimada et al, 1998. General schemes for the 
construction of plasmids for P450's 2A6, 2C19 and 2C9 are shown in Figures 1 A, B and 
C, respectively. 

10 

Expression was done using TB media fortified with IPTG and S-aminolevulinic acid 
(Gillam et al, 1993; Gillam et al, 1995). Bacterial membranes were prepared and 
incubated as described (Gillam et al, 1993, Parikh et al, 1997, Shimada et al, 1998, 
Gillam etal, 1997). 

15 

The nucleotide and corresponding amino acid sequences of P450 2A6, P450 2C19, P450 
2C9 and P450 cin are provided in the sequence listing (see Table 1 for a summary) 
together with the associated proteins, hNPR and flavodoxin. 

20 EXAMPLE 2 

Observation of blue cultures 

o 

Blue pigment was seen in cultures co-expressing P450s with NPR. Initial TLC and visible 
spectroscopy with human, rat and mouse P450 2E1 systems yielded evidence for indigo 
25 production. The production of colour by P450 2A6 cultures was IPTG induction-dependent 
(Figure 2). 
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EXAMPLE 3 

Isolation and characterization of pigments formed in bacterial cultures 

A 1 litre culture of P450 2A6 was grown fof 48 hr at 29°C Centrifugation (at 200 x g for 1 
5 min) yielded a dark blue pellet, which was collected and washed 5 times with H2O 
(suspension and recentrifugation each time). The material was suspended in 10 ml of N,N- 
dimethylformamide and subjected to repeated sonication with a microprobe. CHCI3 (100 
ml) was added and the solution was washed five times with an equal volume of H2O. The 
CHCI3 layer was dried with Na 2 S04 and filtered through paper (A6oo=0.46, volume=156 
10 ml). The material was concentrated at 50°C in vacuo to a low volume, which was streaked 
on a 1 mm x 20 cm x 20 cm silica gel G TLC plate. The plate was developed with CHCI3- 
CH3OH/5O-I v/v (Figure 3). Individual coloured bands were excised and extracted first 
with CHCI3-CH3OH 1-1 v/v and then with acetone, followed by centrifugation and 
concentration under N2. 

15 

The extracellular pigment produced in a P450 2A6 culture was separated into four 
components, two blue and two pink/purple (Figure 3). Mass spectrometry yielded an 
apparent MH+ ion at m/z 263 for compounds 1 , 2 and 4, indicating a molecular mass of 
262. The visible spectra of components 2 and 4 (Figure 4, \ max 603 nm and 552 nm, 
20 respectively) match the literature for indigo and indirubin, respectively (Laatsch and 
Ludwig-Kiihn, 1986; Friedmann et al, 1950; Fearon and Boggust, 1950), which both have 
molecular weights of 262. A definitive MH+ for compound 3 was not apparent, and the 
visible spectra did not match those of compound 2 or 4. 

25 *H NMR of compounds 2 and 4 provided further support for the assignment of these as 
indirubin and indigo (Hart et al., 1992), respectively, particularly the 2-D spectra (Figures 
5 and 6). The symmetric indigo structure is obvious, while indirubin shows the expected 
asymmetry. 

30 UV- visible spectra were recorded in CHC1 3 using a Caryl 4/OLIS spectrophotometer (On- 
Line Instrument Systems, Bogart, GA). NMR spectra were recorded in d6-Me2SO at 298 K 
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using Bruker AM-400 and AM-500 instruments (Bruker, Billerica, MA). Mass spectra 
were obtained in the positive ion mode using a Finnigan 7000 (Finnigan, Sunnyvale, CA) 
electrospray instrument, either by loop injection or by introduction following elution from 
a 2.1 x 150 mm Zorbax C8 column with a gradient of CH3OH increasing from 4.5 to 81% 
v/v over 32 min in 0. 1 % w/v aqueous HCO2H. 

EXAMPLE 4 
Assay of pigment production in vitro 

Bacterial membranes containing recombinant human P450 (0.2 fiM except P450 2D6 (0.05 
(M)) and P450 1A1 (0.1 /iM)) and NADPH-P450 reductase and human liver microsomes 
(0.2 /xM P450) were prepared and incubated with indole (5 mM) under conditions 
supporting P450 catalysis (100 mM Tris-HCl, pH 7.4; 1 mM NADPH, 2.5mM glucose-6- 
phosphate, 0.5 U/ml glucose-6-phosphate dehydrogenase). The P450s most efficient in the 
production of the blue pigment co-migrating with indigo on TLC were 2A6>2C19~2E1 
(Figure 7). However, lower amounts of other pigments were seen with most forms. 

In order to estimate the relative production of pigments by different recombinant P450 
forms expressed in bicistronic format with hNPR, bacterial membranes containing 
recombinant P450 enzymes and human NADPH-P450 reductase (0.2 /xM P450) and 
human liver microsomes (0.04 fiM P450), were incubated with 0.8 mM indole in the 
presence of an NADPH generating system as described above. Three preparations of 
bacterial membranes containing recombinant P450 2C19 and hNPR were tested 
corresponding to different levels of apparent pigment production during bacterial culture 
(1>2, 3). Bacterial membranes from cells transformed with the empty pCW vector or 
expressing recombinant hNPR alone were also included as minus-enzyme controls at 
protein concentrations equivalent to the mean of the protein concentrations in recombinant 
P450 incubations. Reactions were stopped at 0 (t=0 controls) or 30 mins by the addition of 
an equal volume of 1% w/v SDS and absorbance was measured at 665 nm (shown in 
preliminary studies to be the maximal wavelength for pigments produced by recombinant 
P450 2A6 when dissolved in the 0.5% w/v SDS milieu of quenched incubations). Figuyre 
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8 shows that significant pigment production was seen with a range of recombinant P450 
preparations. 

EXAMPLE 5 

5 Further analysis of products of indole metabolism 

In order to characterize the full range of products formed from indole, bacterial membranes 
containing recombinant P450 enzymes and human NADPH-P450 reductase, and rat and 
human liver microsomes (0.2 /iM P450), were incubated with 5 mM indole under 

10 conditions supporting P450 catalysis (100 mM Tris-HCl, pH 7.4; 1 mM NADPH, 2.5 mM 
glucose-6-phosphate, 0.5 U/ml glucose-6-phosphate dehydrogenase). Products formed 
were extracted with methyl-t-butyl ether, desiccated and resuspended in 15% v/v 
acetonitrile: 85% v/v 50 mM potassium phosphate, pH 7.4, then subjected to high 
performance liquid chromatography using a 150 x 3.9 mm Waters C8 Symmetry reverse 

15 phase column and eluted at 1 ml/min with the following gradient: 0-15 mins 15% v/v 
acetonitrile, 85% v/v 50 mM potassium phosphate, pH 7.4; 15-30 mins 15% to 30% v/v 
acetonitrile gradient; 30-35 mins 30% v/v to 15% v/v acetonitrile; 35 to 45 mins isocratic 
at 15% v/v acetonitrile. Under these conditions, i satin eluted at 7 mins and indole at 35 
mins. Other peaks, which were later identified by mass spectrometry as dioxindole and 

20 oxindole, eluted at 4.05 and 10.13 mins, respectively. Additional metabolite peaks eluting 
at times indicated in Table 3 below were not identified but relative yields were compared 
among incubations containing different P450 enzymes. 
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EXAMPLE 6 

Characterization of the metabolic pathway: identification of products generated 
in vitro by recombinant bacterial membranes containing P450 2A6 
and hNPR and by human liver microsomes 

In order to define the primary and secondary metabolic pathways catalyzed by human 
P450 forms, indole (5 mM) was incubated with E. coli membranes containing P450 2A6 
and hNPR for varying time periods (5-30 min) and the products were separated by HPLC 
using slight modifications (lower CH 3 CN concentrations, larger size column - 10 x 250 
mm, Ci8 instead of Cs) of the methods described in Example 5. UV spectra were measured 
on-line using a Thermo-Separations UV6000 rapid scanning spectrophotometer (Thermo- 
Separations, Piscataway, NJ, USA). Under these conditions, the substrate indole was 
eluted at 29.4 min (identified by co-chromatography with an external standard and mass 
spectrum, MH+ at m/z 118). Five of the product peaks eluted earlier and were identified by 
their spectral properties: 

Dioxindole: tR 7.5 min, UV X^ 210 nm, 254 nm, 295 nm and spectrum identical to that 
published by Ward (1923) and Conforth et al (1951). The product of Na 2 S 2 0 4 reduction of 
isatin (Sumpter, 1945) yielded the same compound as judged by HPLC to and UV 
spectrum. 

Isatin: tR 12.6 min, UV Xmax 244 nm, 305 nm. tR and spectrum identical to commercial 
product (Aldrich) and products reported by Julian and Printy (1953) and Ward (1923). 
Mass spectrometry: m/z 148 (MH+). 

Indoxyl: tR 14 J min, UV X^ 239 nm. tR and UV spectra identical to material generated 
by reaction of commercial indoxy-3 -acetate and hog liver esterase (Sigma). Mass 
spectrometry: m/z 134 (MH+). 

Oxindole: tR 18.2 min, UVX^ 203 nm, 248 nm, both the same as commercial material 
(Cornforth et al, 1951; Barth et al, 1972). Mass spectrometry: m/z 134 (MH+). NMR 
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(CDCh) 8 3.56 (s, 2H, H-l), 6.88 (d, 1H, H-4 or H-7), 7.04(t, 1H, H-5 or H-6), 7.24(t, 1H, 
H-5 or H-6), 7.27 (d, 1H, H-4 or H-7), 7.53 (bs, ,H, NH). 

6-Hydroxy indole: tR 20.3 min, UV X,™ 218 nm, 293 nm. Spectrum identical to literature 
5 (Armerego, 1971; Stoll et al, 1955). Mass spectrometry: m/z 134 (MH+). 

No peaks produced by recombinant P450 2A6 or human liver microsomes corresponded to 
4- or 5-hydroxyindole (commercial samples; Sigma), as judged by tR and UV spectra. 

10 The time course for production of the major products from indole is presented in Figure 9. 
At each indicated time point, a reaction aliquot was withdrawn and extracted into CH2CI2 
to stop the reaction. Products (CH 2 C1 2 phase) were concentrated and analyzed by HPLC, 
with the amount of product expressed on the basis of A220 units. Indoxyl was detected only 
in the early portion of the experiment. The production of indigo, as estimated by the A615 

15 in the combined CH 2 C1 2 extract and a fraction prepared from the residual aqueous phase by 

adding two volumes of dimethylformamide, showed a lag but began within 2 min. The 

© 

time courses of indoxyl and indigo appear to be related. 

Oxindole appears to be the major primary oxidation product of indole formed by P450 
20 2A6. The concentration of indole was varied in 2 min incubations with membranes 
containing P450 2A6 and hNPR. The parameters feat = 6.9 (±0.6) min 1 and Km 122 (±30) 
\xM were determined. 

When isatin was used as the substrate in similar incubations, it was rapidly reduced to 
25 dioxindole in the P450 2A6 system (only product). Subsequent experiments with purified 
hNPR showed that this enzyme catalyzed the reduction efficiently itself and the P450 2A6 
did not seem to play a major role. The rate of reduction measured(with 1 mM isatin) was 
290 min" 1 . 

30 Oxindole (1 mM) was stable when added to the typical P450 2A6/hNPR membrane 
system; no significant product formation was observed. 
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These results support the scheme of indole metabolism in P450 2A6/hNPR/NADPH 
systems and in NADPH-fortified human and rat liver microsomes follows the general 
features shown in Figure 10. 

EXAMPLE 7 
Bacterial P450 monooxygenase 

Similar experiments as those described above were conducted using a cytochrome P450 
monooxygenase from Citrobacter brakii, designated P450 tin. The nucleotide and 
corresponding amino acid sequences are shown in <400>20 and <400>21, respectively. 
Bicistronic constructs were prepared with the associated protein being flavodoxin 
(<400>22 and <400>23). Again, bacterial culture medium turned blue and data indicated 
the presence of indigo. 

EXAMPLE 8 

Characterization of the potential of P4S0 forms to catalyze indoxyl oxidation 

Whereas the preceding experiments had provided evidence that various P450 forms could 
catalyse the monooxygenation of indole to indoxyl and that this could lead to the formation 
of indigo and related compounds, it was of interest to determine whether multiple steps 
along the pathway leading to indigo could be catalysed by the cytochrome P450 system. In 
previous systems characterized in regard to indigo generation, the oxidation of indoxyl was 
considered to occur spontaneously in aerobic conditions. However, the experiments 
outlined above were undertaken in a reducing environment (e.g. 1 mM NADPH in the in 
vitro experiments), raising the possibility that spontaneous oxidation may not be the sole 
mechanism by which two indoxyl monomers were transformed into one molecule of 
indigo. The object of the following experiments was to determine whether the P450 
enzyme system could catalyse steps in the pathway leading to indigo subsequent to the 
initial oxidation of indole. 
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Bacterial membranes containing 0.1 \xM P450 recombinant P450 2E1 and human NADPH- 
P450 reductase were incubated with an indoxyl generating system (0.5 mM indoxyl acetate 
and 50 Units porcine liver esterase [Sigma Chemical Company, Castle Hill, Australia] 
under conditions supporting P450 catalysis (100 mM Tris-HCl, pH 7.4; 1 mM NADPH, 
5 2.5 mM glucose-6-phosphate, 0.5 U/ml glucose-6-phosphate dehydrogenase). Reactions 
were started by the addition of the NADPH generating system and esterase and incubated 
at 37°C in a thermostatically controlled cell compartment of a Beckman DU640 UV-Vis 
spectrophotometer. Indigo production was monitored by absprbance at 602 nm. 

10 Two different preparations of membranes containing recombinant P450 2E1 and hNPR 
were tested, differing in the ratio of P450 to reductase (1:1.5 in preparation 1 and 1:0.95 in 
Preparation 2 in Table 4 below). Three different negative controls were used: incubations 
carried out in the absence of bacterial membranes; incubations carried out with bacterial 
membranes containing neither P450 nor hNPR, matched for protein concentration to the 

1 5 full incubations; and incubations containing bacterial membranes containing only hNPR 
(no P450), matched to the hNPR concentration in full incubations. Table 4 shows that 
incubations containing P450 2E1 showed elevated indigo production compared to all three 
controls. While the mean rate of indigo production in the first negative control (no 
bacterial membranes) was constant between the two sets of assays and rates for the other 

20 negative controls were similar. Incubations containing Preparation 2 showed a markedly 
higher rate of catalysis of indigo formation compared to Preparation 1, a difference that 
may be due to the different P450 2El:hNPR ratios in the two preparations. 
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TABLE4 

Indigo production in incubation with recombinant P450 2E1 using 
an indoxyl generating system 



XnCUDatlOD 


fxdic ui inuigu 

production 

{ADSOrDa D Cc 

units/min)" 


ruiu llltl cdaC 

over highest 
control 


P450 2El/hNPR - Preparation 1 


0.028 +/-0.002 


1.8 


No membranes 


0.006 +/-0.001 




Bacterial membranes containing no P450 
2E1 orhNPR 


0.013 +/-0.002 




Bacterial membranes containing hNPR 
only 


0.016 +/-0.001 




P450 2El/hNPR - Preparation 2 


0.040 +/-0.007 


3.1 


No membranes 


0.006 +/-0.001 - 




Bacterial membranes containing no P450 
2E1 orhNPR 


0.013 +/-0.001 




Bacterial membranes containing hNPR 
only 


0.013 +/-0.001 




a Values represent the mean +/- 


standard deviation of 


three independent 



measurements. 



These results support the scheme of indigo and indirubin production catalyzed by P450 
10 2A6/hNPR/NADPH systems in in NADPH-fortified human and rat liver microsomes as 
shown in Figure 1 1 . 
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EXAMPLE 9 

Construction of vectors for transient expression of recombinant human 
cytochrome P450 and reductase in plant tissue 

5 Plant vectors containing the recombinant human cDNAs were prepared by subcloning the 
cDNAs for cytochrome P450 forms and hNPR into the vectors pGTVa (containing the 35S 
promoter) and p2.5GEM-ll (containing the AIM- 1 promoter) using the strategies outlined 
in Figure 12. Briefly, cDNA sequences for human cytochrome P450 forms and hNPR, 
which had been modified at the 5' end of the coding sequences to enhance expression in 

10 bacteria, were further modified to enhance expression in the eukaryotic (plant) system. 
This was effected by PCR-based oligonucleotide-directed mutagenesis of a short segment 
of the 5' coding sequence of the cDNA. Mutagenic primers were also designed to 
incorporate a restriction site to enable subcloning into the cognate vector and a ribosomal 
binding site sequence upstream of the translation start site (initial ATG codon). The 

15 resultant PCR-amplified DNA fragments were digested with appropriate restriction 
endonucleases and ligated into pGTVa or p2.5GEM-ll together with fragments encoding 
the remainder of the respective human cDNAs. 

Table 5 shows alignments of approximately the first thirty nucleotides encoding 
20 cytochrome P450 2AE6, P450 2C19 and P450 2E1 monooxygenases, which highlight the 
differences required for expression in bacterial relative to the native eukaryotic sequence 
for use in plant cells. 
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EXAMPLE 10 

Bombardment of plant tissue with genetic material encoding a human P450 enzyme 
and/or human reductase to effect transient expression of a functional 
recombinant cytochrome P450 system 

5 

The aim of these experiments is to transiently express a human P450 enzyme with or 
without human NADPH-P450 reductase (hNPR) in a white flower petal, namely petunia 
and to determine whether expression could be detected by the visible production of a blue 
pigment in plant tissue, namely indigo. A secondary aim is to determine whether indigo 
10 production might be enhanced by supplementation with low concentrations of indole. 

The gene bombardment protocol is initially optimised using the reporter vector pGTVa- 
GUS. GUS expression is assayed using the method described by Jefferson et al., (1992). 
Efficiency of the transformation is measured by the mean number of blue spots per petal 
15 bombardment. The parameters examined during these initial optimisation experiments are 
target distance, bombardment pressure and petal developmental stage. 

Plasmid DNA is obtained from E.coli using a standard alkaline lysis procedure with and 
without additional procedures for purification of the resultant DNA (Sambrook et al., 
20 1989). The DNA is prepared for bombardment by combining various amounts of tungsten 
particle solution with DNA. After vortexing, the particles are precipitated with CaCh and 
spermidine. After removing a portion of the supernatant, the tungsten suspension was 
vortexed and an aliquot removed for bombardment. o 

25 In this experiment, white petunia flowers are used for bombardment. Petunia plants having 
other colours may also be used. The device used for bombardment is the particle inflow 
gun developed by Finer et aL 9 (1992) which propels tungsten particles directly in a stream 
of helium towards the target. The petal is placed in a petri dish containing filterpaper 
moistened with either sterile water or a solution of indole or osmotic medium (with or 

30 without indole). Each petal preparation is bombarded with one of:- 

(a) vector containing a P450 cDNA alone; or 
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(b) vector containing a P450 cDNA plus vector containing the hNPR cDNA; or 

(c) vector containing the GUS control cDNA. 

5 In some cases the vector containing the GUS control cDNA is bombarded simultaneously 
with either or both types of vector containing human-derived sequences. 

The petal distance and helium pressure are as found optimal in the initial optimization 
experiments. A negative control containing tungsten particles only is included for both the 
1 0 water and indole petal samples. 

The success of the bombardment is analyzed by the presence of blue spots after overnight 
incubation of the bombarded petal in either water or indole solution or osmotic medium 
(indicating in situ production of indigo). 

15 

EXAMPLE 11 

Introduction of genetic mutant encoding P450 monooxygenase to cells of 

non-white plant 

20 Different shades of colours are proposed to be obtainable using different coloured recipient 
plants for the genetic material encoding the cytochrome P450 monooxygenase/associated 
protein of the present invention. Accordingly, plants having coloured flowers such as red, 
blue, yellow, orange, purple, pink, spotted and/or mottled are selected as recipients of the 
subject genetic material. Various colour combinations are expected which greatly enhances 

25 the variety of colour phenotypes resulting from the use of the instant genetic constructs. 

EXAMPLE 12 
Transformation procedures 

30 Yeast strains are transformed with genetic material according to Ito et aL y (1990). Genetic 
material is introduced into the Agrobacterium tumefaciens strain AGLO by adding 5 /ig of 
plasmid DNA to 100 /xl of competent AGLO cells prepared by inoculating a 50 ml culture 
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of MG/L (Garfinkel and Nester, 1980). These are cultured and grown for 16 hours with 
shaking at 28°C. The cells are then pelleted and resuspended in 0.5 ml of 85% v/v 100 mM 
CaCh/15% v/v glycerol. The DNA-Agrobacterium mixture is frozen by incubation in 
liquid N 2 for 2 minutes and then allowed to thaw by incubation at 37°C for 5 minutes. The 
5 DN A/bacterial mix is then placed on ice for a further 10 minutes. The cells are then mixed 
with 1 ml of LB (Sambrook et ai, 1989) media and incubated with shaking for 16 hours at 
28°C. Cells of A. tumefaciens carrying genetic material are selected on LB agar plates 
containing 10 jig/ml gentamycin or other suitable selection such as another antibiotic or a 
herbicide. The presence of genetic material is confirmed by Southern analysis of DNA 
10 isolated from the gentamycin-resistant transformants or any other selectable molecule such 
as another antibiotic or a herbicide. 

Petunia transformations 

15 (a) Plant Material 

Leaf tissue from mature plants is treated in 1.25% w/v sodium hypochlorite for 2 minutes 
and then rinsed three times in sterile water. The leaf tissue is then cut into 25 mm 2 squares 
and precultured on MS media (Murashige and Skoog, 1962) supplemented with 0.05 mg/1 
20 kinetin and 1.0 mg/1 2,4-dichlorophenoxyacetic acid (2,4-D) for 24 hr. 

(b) Co-cultivation of Agrobacterium Tissue 

Agrobacterium tumefaciens strain AGL0 containing genetic material is maintained at 4°C 
25 on MG/L agar plates with 100 mg/L gentamycin. A single colony is grown overnight in 
liquid medium containing 1% w/v Bacto-peptone, 0.5% w/v Bacto-yeast extract and 1% 
w/v NaCL final concentration of 5 x 10 8 cells/ml is prepared the next day by dilution in 
liquid MS medium containing B5 vitamins (Gamborg et al, 1968) and 3% w/v sucrose 
(BPM). The leaf discs were dipped for 2 minutes into BPM containing AGLO/genetic 
30 material. The leaf discs are then blotted dry and placed on co-cultivation media for 4 days. 
The co-cultivation medium consists of SH medium (Schenk and Hildebrandt, 1972) 
supplemented with 0.05 mg/1 kinetin and 1.0 mg/1 2,4-D and included a feeder layer of 
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tobacco cell suspension spread over the co-cultivation medium with a filter paper placed 
on top of the tobacco cell suspension. 

(c) Recovery of transgenic plants 

5 

After co-cultivation, the leaf discs are transferred to a selection medium (MS medium 
supplemented with 3% w/v sucrose, a-benzylaminopurine (BAP) 2 mg/1, 0.5 mg/1 a- 
naphthalene acetic acid (NAA), kanamycin 300 mg/1, 350 mg/L cefotaxime and 0.3% w/v 
Gelrite Gellan Gum (Schweizerhall)). Regenerating explants are transferred to fresh 

10 selection medium after 4 weeks. Adventitious shoots which survive the kanamycin 
selection are isolated and transferred to BPM containing 100 mg/1 kanamycin and 200 mg/1 
cefotaxime for root induction. All cultures are maintained under a 16 hour photoperiod (60 
/imol. m" 2 , s" 1 cool white fluorescent light) at 23 ± 2°C. When roots reach 2-3 cm in length 
the transgenic petunia plantlets are transferred to autoclaved Debco 51410/2 potting mix in 

15 8 cm tubes. After 4 weeks, plants are replanted into 15 cm pots, using the same potting 
mix, and maintained at 23°C under a 14 hour photoperiod (300 junol. m* 2 , s~ l mercury 
halide light). 

EXAMPLE 13 

20 Transformation of Dianthus caryophyllus 

(a) Plant material 

Dianthus caryophyllus, (cv. Crowley Sim, Red Sim, Laguna) cuttings are used in this 
25 experiment. The outer leaves are removed and the cuttings are sterilized briefly in 70% v/v 
ethanol followed by 1.25% w/v sodium hypochlorite (with Tween 20) for 6 minutes and 
rinsed three times with sterile water. All the visible leaves and axillary buds are removed 
under the dissecting microscope before co-cultivation. 
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(b) Co-cultivation of Agrobacterium and Dianthus Tissue 

Agrobacterium tumefaciens strain AGLO containing a genetic construct encoding a 
cytochrome P450 monooxygenase and optionally an associated protein as herein described 
5 is maintained at 4 on MG/1 (Garfinkel and Nester, 1980) agar plates with 100 mg/1 
gentamycin. A single colony is grown overnight in liquid MG/L broth and diluted to 5 x 
10 8 cells/ml the next day before inoculation. Dianthus tissue is co-cultivated with 
Agrobacterium on MS medium (Murashige and Skoog, 1962) supplemented with 3% w/v 
sucrose, 5 mg/1 a-naphthalene acetic acid (NAA), 20 \xM acetosyringone and 0.8% w/v 
10 Difco Bacto Agar (pH 5.7). 

(c) Recovery of Transgenic Dianthus Plants 

Co-cultivated tissue is transferred to MS medium supplemented with 1 mg/1 
15 benzylaminopurine (BAP), 0.1 mg/1 NAA, 150 mg/1 kanamycin, 500 mg/1 ticarcillin and 
0.8% w/v Difco Bacto Agar (selection medium). After three weeks, explants are 
transferred to fresh selection medium and care is taken at this stage to remove axillary 
shoots from stem explants. After 6-8 weeks on selection medium, healthy adventitious 
shoots are transferred to hormone free MS medium containing 3% w/v sucrose, 150 mg/1 
20 kanamycin, 500 mg/1 ticarcillin, 0.8% w/v Difco Bacto Agar. At this stage, GUS 
histochemical assay (Jefferson, 1987) and/or NPT II dot-blot assay (McDonnell et ai y 
1987) are used to identify transgenic shoots. Transgenic shoots are transferred to MS 
medium supplemented with 3% w/v sucrose, 500 mg/1 ticarcillin and 0.4% w/v Gelrite 
Gellan Gum (Schweizerhall) for root induction. All cultures are maintained under a 16 
25 hour photoperiod (120 nE cool white fluorescent light) at 23 ± 2°C. When plants are 
rooted and reached 4-6 cm tall they are acclimatised under mist. A mix containing a high 
ratio of perlite (75% or greater) soaked in hydroponic mix (Kandreck and Black, 1984) is 
used for acclimation, which typically lasts 4-5 weeks. Plants are acclimatised at 23°C 
under a 14 hour photoperiod (200 \iE mercury halide light). 
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EXAMPLE 14 
Transformation of Rosa hybrida 

1. Rosa hybrida cv Royalty 

5 

Plant tissues of the rose cultivar Royalty are transformed according to the method 
disclosed in PCT/AU9 1/044 12, having publication number WO92/00371. 

2. Rosa hybrida cv Kardinal 

10 (a) Plant Material 

Kardinal shoots are used. Leaves are removed and the remaining shoots (5-6 cm) are 
sterilized in 1.25 % w/v sodium hypochlorite (with Tween 20) for 5 minutes followed by 
three rinses with sterile water. Isolated shoot tips are soaked in sterile water for 1 hr and 
1 5 precultured for 2 days on MS medium containing 3% w/v sucrose, 0.1 mg/L BAP, 0. 1 mg/1 
kinetin, 0.2 mg/1 Gibberellic acid, 0.5% w/v polyvinyl pyrrolidone and 0.25% w/v Gelrite 
Gellan Gum, before co-cultivation. 

(b) Co-cultivation of Agrobacterium and Rosa shoot Tissue 

20 

Agrobacterium tumefaciens strains ICMP 8317 (Janssen and Gardner, 1989) and AGL0, 
containing genetic constructs encoding a cytochrome P450 monooxygenase and optionally 
an NPR are maintained at 4°C on MG/L agar plates with 100 mg/L gentamycin. A single 
colony from each Agrobacterium strain is grown overnight in liquid MG/L broth. A final 
25 concentration of 5 x 10 8 cells/ml is prepared the next day by dilution in liquid MG/L. 
Before inoculation, the two Agrobacterium cultures are mixed in a ratio of 10:1. A 
longitudinal cut is made through the shoot tip and an aliquot of 2 \i\ of the mixed 
Agrobacterium cultures is placed as a drop on the shoot tip. The shoot tips are co- 
cultivated for 5 days on the same medium used for preculture. 

30 

Agrobacterium tumefaciens strain AGL0 is maintained at 4°C on MG/L agar plates with 
100 mg/L kanamycin. A single colony from each Agrobacterium strain is grown overnight 
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in liquid MG/L broth. A final concentration of 5 x 10 8 cells/ml is prepared the next day by 
dilution in liquid MG/L. 

(c) Recovery of Transgenic Rosa Plants 

5 

After co-cultivation, the shoot tips are transferred to selection medium. Shoot tips are 
transferred to fresh selection medium every 3-4 weeks. Galls observed on the shoot tips are 
excised when they reached 6-8 mm in diameter. Isolated galls are transferred to MS 
medium containing 3% w/v sucrose, 25 mg/1 kanamycin, 250 mg/1 cefotaxime and 0.25% 

10 w/v Gelrite Gellan Gum for shoot formation. Shoots regenerated from gall tissue are 
isolated and transferred to selection medium. GUS histochemical assay and callus assay 
are used to identify transgenic shoots. Transgenic shoots are transferred to MS medium 
containing 3% w/v sucrose, 200 mg/1 cefotaxime and 0.25% w/v Gelrite Gellan Gum for 
root induction. All cultures are maintained under 16 hour photoperiod (60 |iE cool white 

15 fluorescent light) at 23 ± 2°C. When the root system is well developed and the shoot 
reached 5-7 cm in length the transgenic rose plants are transferred to autoclaved Debco 
514110/2 potting mix in 8 cm tubes. After 2-3 weeks plants are replanted into 15 cm pots 
using the same potting mix and maintained at 23°C under a 14 hour photoperiod (300 \iE 
mercury halide light). After 1-2 weeks potted plants are moved to glasshouse (Day/Night 

20 temperature : 25-28°C/14°C) and grown to flowering. 

EXAMPLE 15 
Transformation of Chrysanthemum morifolium 

25 (a) Plant Material 

Chrysanthemum morifolium (cv. Blue Ridge, Pennine Chorus) cuttings are obtained. 
Leaves are removed from the cuttings, which were then sterilized briefly in 70% v/v 
ethanol followed by 1.25% w/v sodium hypochlorite (with Tween 20) for 3 minutes and 
30 rinsed three times with sterile water. Internodal stem sections are used for co-cultivation. 
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(b) Co-cultivation of Agrobacterium and Chrysanthemum Tissue 

Agrobacterium tumefaciens strain LBA4404 (Hoekema et al, 1983), containing is grown 
on MG/1 agar plates containing 50 mg/1 rifampicin and 10 mg/1 gentamycin. A single 
5 colony from each Agrobacterium is grown overnight in the same liquid medium. These 
liquid cultures are made 10% v/v with glycerol and 1 ml aliquots transferred to the freezer 
(-80° C). A 100-200 \i\ aliquot of each frozen Agrobacterium is grown overnight in liquid 
MG/1 containing 50 mg/1 rifampicin and 10 mg/1 gentamycin. A final concentration of 5 x 
10 8 cells/ml is prepared the next day by dilution in liquid MS containing 3% w/v sucrose. 
10 Stem sections are co-cultivated with Agrobacterium in co-cultivation medium for 4 days. 

(c) Recovery of Transgenic Chrysanthemum Plants 

After co-cultivation, the stem sections were transferred to selection medium. After 3-4 
15 weeks, regenerating explants are transferred to fresh medium. Adventitious shoots which 
survive the kanamycin selection are isolated and transferred to MS medium containing 
kanamycin and cefotaxime for shoot elongation and root induction. All cultures are 
maintained under a 16 hour photoperiod (80 \iE cool white fluorescent light) at 23 ± 2°C. 
Leaf samples are collected from plants which rooted on kanamycin and Southern blot 
20 analysis is used to identify transgenic plants. When transgenic chrysanthemum plants reach 
4-5 cm in length, they are transferred to autoclaved Debco 51410/2 potting mix in 8 cm 
tubes. After 2 weeks, plants are replanted into 15 cm pots using the same potting mix and 
maintained at 23°C under a 14 hour photoperiod (300 jiE mercury halide light). After 2 
weeks potted plants are moved to glasshouse (Day/Night temperature : 25-28°C/14°C) and 
25 grown to flowering. 

EXAMPLE 16 
Transgenic plant phenotype analysis 

30 The analysis of transgenic plant phenotypes in relation to changes in flower or plant tissue 
colour is dependent on the recipient plants. For example, where genetic material is 
introduced into plants having white flowers, plant flower colour is determined using colour 
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codes taken from the Royal Horticultural Society's Colour Chart (RHSCC). This provides 
one means by which to describe the colour phenotypes observed. The determination of 
designated numbers, however, should be taken only as a guide to the perceived colours and 
should not be regarded as limiting the possible colours which may be obtained. 

5 

Where a recipient plant comprises non-white flowers, a colour change may be assessed 
inter alia with respect to colour per se, uniformity of colour, colour distribution as well as 
chemically such as by analysing possible combinations of indigo or its derivatives or 
precursors and endogenous pigment molecules (e.g. flavonoid compounds). 

10 

Those skilled in the art will appreciate that the invention described herein is susceptible to 
variations and modifications other than those specifically described. It is to be understood 
that the invention includes all such variations and modifications. The invention also 
includes all of the steps, features, compositions and compounds referred to or indicated in 
15 this specification, individually or collectively, and any and all combinations of any two or 
more of said steps or features. 
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CLAIMS 

1. Use of genetic material encoding a cytochrome P450 protein or a functional 
derivative or homologue thereof in the generation of a genetically modified cell, which cell 
has the capacity to produce a pigment in the presence of indole or a precursor, analogue or 
derivative thereof upon expression of said genetic material. 

2. Use according to Claim 1 wherein the genetic material, encoding the cytochrome 
P450 protein or its functional derivative or homologue, is co-expressed with genetic 
material encoding an associated protein. 

3. Use according to Claim 2 wherein the genetic material encoding an associated 
protein is introduced into the cell. 

4. Use according to Claim 2 wherein the genetic material encoding an associated 
protein is indigenous to the cell. 

5. Use according to Claim 2 or 3 or 4 wherein the associated protein selected from 
a co-enzyme-dependent cytochrome P450 reductase and/or a cytochrome bs and/or a 
flavodoxin or a functional derivative or homologue of either molecule. 

6. Use according to Claim 1 wherein the cytochrome P450 protein is of 
mammalian origin. 

7. Use according to Claim 1 wherein the cytochrome P450 protein is of microbial 
origin. 

8. Use according to Claim 6 wherein the cytochrome P450 protein is a co-enzyme- 
dependent cytochrome P450 monooxygenase. 

9. Use according to Claim 8 wherein the cytochrome P450 protein is P450 2A6 
encoded by a nucleotide sequence substantially as set forth in <400>17 or a nucleotide 
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sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>17 or its complementary form under low 
stringency conditions. 

10. Use according to Claim 8 wherein the cytochrome P450 protein is P450 2E1 
encoded by a nucleotide sequence substantially as set forth in <400>18 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>18 or its complementary form under low 
stringency conditions. 

11. Use according to Claim 8 wherein the cytochrome P450 protein is P450 2C19 
encoded by a nucleotide sequence substantially as set forth in <400>19 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>19 or its complementary form under low 
stringency conditions. 

12. Use according to Claim 8 wherein the cytochrome P450 protein is P450 cin 
encoded by a nucleotide sequence substantially as set forth in <400>20 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>20 or its complementary form under low 
stringency conditions. 

13. Use according to any one of the preceding claims wherein the genetically 
modified cell is of plant origin. 

14. Use according to any one of Claims 1 to 12 wherein the genetically modified 
cell is of microbial origin. 

15. A method of inducing a colour change in a plant or part of a plant, said method 
comprising introducing into plant cells a genetic construct encoding a cytochrome P450 
protein or a functional derivative or homologue thereof, regenerating a plant from said 
plant cells and subjecting said plant to appropriate conditions sufficient to permit 
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expression of a nucleotide sequence on said genetic construct which encodes said 
cytochrome P450 protein such that in the presence of indole or a precursor, analogue or 
derivative thereof said cytochrome P450 protein facilitates the conversion of indole or its 
precursor, analogue or derivative to an intermediate which is capable of oxidation to a 
pigment. 

16. A method according to Claim 15 wherein the genetic construct encoding the 
cytochrome P450 protein is co-expressed with genetic material encoding an associated 
protein. 

17. A method according to Claim 16 wherein the associated protein is selected from 
a co-enzyme-dependent cytochrome P450 reductase and/or a cytochrome b 5 or a functional 
derivative or homologue of either molecule. 

18. A method according to Claim 15 wherein the cytochrome P450 protein is of 
mammalian origin. 

19. A method according to Claim 15 wherein the cytochrome P450 protein is of 
microbial origin. 

20. A method according to Claim 15 wherein the cytochrome P450 protein is a co- 
enzyme-dependent cytochrome P450 monooxygenase. 

21. A method according to Claim 20 wherein the cytochrome P450 protein is P450 
2A6 encoded by a nucleotide sequence substantially as set forth in <400>17 or a 
nucleotide sequence having at least 50% similarity thereto after optimal alignment or a 
nucleotide sequence capable of hybridizing to <400>17 or its complementary form under 
low stringency conditions. 

22. A method according to Claim 20 wherein the cytochrome P450 protein is P450 
2E1 encoded by a nucleotide sequence substantially as set forth in <400>18 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
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sequence capable of hybridizing to <400>18 or its complementary form under low 
stringency conditions. 

23. A method according to Claim 20 wherein the cytochrome P450 protein is P450 
2C19 encoded by a nucleotide sequence substantially as set forth in <400>19 or a 
nucleotide sequence having at least 50% similarity thereto after optimal alignment or a 
nucleotide sequence capable of hybridizing to <400>19 or its complementary form under 
low stringency conditions. 

24. A method according to Claim 20 wherein the cytochrome P450 protein is P450 
cin encoded by a nucleotide sequence substantially as set forth in <400>20 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>20 or its complementary form under low 
stringency conditions. 

25. A genetically modified cell or multicellular animal or plant or progeny thereof or 
parts of said transgenic plant or animal wherein said transgenic plant or animal comprises 
expressible genetic material encoding a microbial or mammalian cytochrome P450 protein. 

26. A genetically modified cell or multicellular animal or plant or progeny or parts 
thereof according to Claim 25 wherein upon expression of said genetic material and in the 
presence of an associated protein and indole or a precursor, analogue or derivative thereof, 
the indole or its precursor, analogue or derivative is converted to an intermediate which is 
capable of oxidation to a pigment. 

27. A genetically modified cell according to Claim 25 or 26 wherein the cell is a 
microbial cell. 

28. A transgenic plant or progeny or parts thereof comprising expressible genetic 
material encoding a microbial or mammalian cytochrome P450 monooxygenase such that 
upon expression of said genetic material and in the presence of an associated protein and 
indole or a precursor, analogue or derivative thereof, the indole or its precursor, analogue 
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or derivative is converted to an intermediate which is capable of oxidation to a pigment. 

29. A transfected or transformed cell, tissue, organ or non-cellular material which 
contains or is capable of producing cytochrome P450 protein or a functional derivative or 
homologue thereof. 

30. A transfected or transformed cell, tissue, organ or non-cellular material 
according to Claim 29 wherein the cytochrome P450 protein is P450 2A6 encoded by a 
nucleotide sequence substantially as set forth in <400>17 or a nucleotide sequence having 
at least 50% similarity thereto after optimal alignment or a nucleotide sequence capable of 
hybridizing to <400>17 or its complementary form under low stringency conditions. 

31. A transfected or transformed cell, tissue, organ or non-cellular material 
according to Claim 29 wherein the cytochrome P450 protein is P450 2E1 encoded by a 
nucleotide sequence substantially as set forth in <400>18 or a nucleotide sequence having 
at least 50% similarity thereto after optimal alignment or a nucleotide sequence capable of 
hybridizing to <400>18 or its complementary form under low stringency conditions. 

32. A transfected or transformed cell, tissue, organ or non-cellular material 
according to Claim 29 wherein the cytochrome P450 protein is P450 2C19 encoded by a 
nucleotide sequence substantially as set forth in <400>19 or a nucleotide sequence having 
at least 50% similarity thereto after optimal alignment or a nucleotide sequence capable of 
hybridizing to <400>19 or its complementary form under low stringency conditions. 

33. A transfected or transformed cell, tissue, organ or non-cellular material 
according to Claim 29 wherein the cytochrome P450 protein is P450 cin encoded by a 
nucleotide sequence substantially as set forth in <400>20 or a nucleotide sequence having 
at least 50% similarity thereto after optimal alignment or a nucleotide sequence capable of 
hybridizing to <400>20 or its complementary form under low stringency conditions. 

34. Flowers, parts of plants, fibrous material from plants, non-cellular material and 
reproductive portions of plants which are coloured or have the capacity to be coloured in 
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the presence of indole or a precursor thereof due to the presence of cytochrome P450 
protein and an associated protein. 

35. Flowers, parts of plants, fibrous material from plants, non-cellular material and 
reproductive portions of plants according to Claim 34 wherein the colour produced in the 
presence of indole requires an introduced or indigenous associated protein. 

36. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the cytochrome P450 monooxygenase comprises 
the amino acid sequence (F/PAV/I7Y)(G/S/C/N/H/E)X 
(G/D/A/E)X(R/H/S/K/T)XCXi(G/A) wherein X is any amino acid and Xi is selected from 
V, M, A, L, I, P, F or T and wherein where residues in parentheses represent alternatives at 
a single position. 

37. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the cytochrome P450 monooxygenase comprises 
the amino acid sequence (F/L)(G/S)XGX(R/H)XCXi(G/A) where X is any amino acid and 
Xi is selected from V, M, A, L, I, P, F or T and where residues in parentheses represent 
alternatives at a single position. 



38. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
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capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the nucleotide sequence is selected from:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>8, <400>10 or <400>12; 

(ii) a nucleotide sequence defined by <400>7, <400>9 or <400>1 1 ; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>2, 
<400>4 or <400>6; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>1, <400>3 or <400>5; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or(v). 

39. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the genetic sequence further comprises a 
nucleotide sequence encoding an NPR wherein the nucleotide sequence is selected from:- 



(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>14and<400>16; 
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(ii) a nucleotide sequence defined by <400>1 3 and <400>1 5; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>14 and 
<400>16; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>13 and <400>15; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or (v). 

40. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the nucleotide sequence is selected from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>21; 

(ii) a nucleotide sequence defined by <400>20; 



(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 
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a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>21; 

a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>20; or 

a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (Hi), (iv) 
or(v). 

41. Use of a genetic sequence comprising a nucleotide sequence expressible in plant 
cells which encodes a cytochrome P450 monooxygenase in the generation of a transgenic 
plant or transformed cells of a plant which, upon expression of said nucleotide sequence, 
and in the presence of indole or a precursor, analogue and/or derivative thereof, have the 
capacity to convert indole or a precursor, analogue or derivative thereof to an intermediate 
which is oxidizable to a pigment wherein the genetic sequence further comprises a 
nucleotide sequence encoding an bacterial flavodoxin wherein the nucleotide sequence is 
selected from: 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>23; 

(ii) a nucleotide sequence defined by <400>22; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>23; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>22; or 
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(vi) a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or (v). 

42. Use of plant or plant material as a source of pigment for use, for example, in the 
dye or cosmetics industry or as commercial tags or as markers for transformation of 
microbial, plant or animal cells. 

43. A genetic construct comprising a cistron encoding a microbial or mammalian 
cytochrome P450 protein for use in the generation of a transformed plant cell such that 
upon expression of said cistron, the cytochrome P450 facilitates conversion of indole or a 
precursor thereof to a molecule oxidizable to a pigment. 

44. A genetic construct according to Claim 43 wherein the cytochrome P450 
monooxygenase comprises the amino acid sequence (F/P/W/IVY)(G/S/C/N/H/E)X 
(G/D/A/E)X(R/H/S/K/T)XCXi(G/A) wherein X is any amino acid and X, is selected from 
V, M, A, L, I, P, F or T and wherein where residues in parentheses represent alternatives at 
a single position. 

45. A genetic construct according to Claim 43 wherein the cytochrome P450 
monooxygenase comprises the amino acid sequence (F/L)(G/S)XGX(R/H)XCX](G/A) 
where X is any amino acid and Xi is selected from V, M, A, L, I, P, F or T and where 
residues in parentheses represent alternatives at a single position. 

46. A genetic construct according to Claim 43 or 44 or 45 comprising a nucleotide 
sequence selected from:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>8, <400>10, <400>12 or <400>21 ; 



/ 
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a nucleotide sequence defined by <400>7, <400>9 or <400>11 or 
<400>20; 

a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>2, 
<400>4, <400>6 or <400>21; 

a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>1, <400>3, <400>5 or <400>20; or 

a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or (v). 

47. A genetic construct according to Claim 43 or 44 or 45 or 46 further comprising a 
nucleotide sequence encoding an associated protein human NPR (hNPR) wherein the 
nucleotide sequence is selected from:- 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>16; 

(ii) a nucleotide sequence defined by <400> 15; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>14; 
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(v) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>13; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or(v). 

48. A genetic construct according to Claim 43 or 44 or 45 or 46 further comprising a o 
nucleotide sequence encoding bacterial flavodoxin wherein the nucleotide sequence is 
selected from;- 

(i) a nucleotide sequence encoding the amino acid sequence defined in 
<400>23; 

(ii) a nucleotide sequence defined by <400>22; 

(iii) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence in (i) or (ii); 

(iv) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence encoding the amino acid sequence defined in <400>23; 

(v) a nucleotide sequence having at least 50% similarity to a nucleotide 
sequence defined in <400>22; or 

(vi) a nucleotide sequence capable of hybridizing under low stringency 
conditions to a nucleotide sequence in one or more of (i), (ii), (iii), (iv) 
or (v). 



49. An isolated nucleic acid molecule comprising a nucleotide sequence encoding or 
complementary to a sequence encoding a cytochrome P450 cin monooxygenase from 
Citrobacter brakii. 
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50. An isolated nucleic acid molecule according to Claim 49 wherein the 
cytochrome P450 monooxygenase comprises an amino acid sequence substantially as set 
forth in <400>2 1 or an amino acid sequence having at least about 50% similarity thereto 
after optimal alignment. 

51. An isolated nucleic acid molecule according to Claim 50 comprising a 
nucleotide sequence substantially as set forth in <400>20 or a nucleotide sequence having 
at least about 50% similarity thereto after optimal alignment or a nucleotide sequence 
capable of hybridizing to <400>20 or its complementary form under low stringency 
conditions. 

52. A method of inducing a colour change in or by a microbial cell, said method 
comprising introducing into microbial cells a genetic construct encoding a cytochrome 
P450 protein or a functional derivative or homologue thereof, and subjecting said cells to 
appropriate conditions sufficient to permit expression of a nucleotide sequence on said 
genetic construct which encodes siad cytochrome P450 protein such that in the presence of 
indole or a precursor, analogue or derivative thereof said cytocl^ome P450 protein 
facilitates the conversionof indole or its precursor, analogue or derivative to an 
intermediate which is capable of oxidation to a pigment. 

53. A method according to Claim 52 wherein the genetic construct encoding the 
cytochrome P450 protein is co-expressed with genetic material encoding an associated 
protein. 

54. A method according to Claim 53 wherein the associated protein is selected from 
a co-enzyme-dependent cytochrome P450 reductase and/or a cytochrome bs and/or a 
flavodoxin or a functional derivative or homologue of either molecule. 

55. A method according to Claim 52 wherein the cytochrome P450 protein is of 
mammalian origin. 
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56. A method according to Claim 52 wherien the cytochrome P450 protein is of 
microbial origin. 

57. A method according to Claim 55 wherein the cytochrome P450 protein is a co- 
enzyme-dependent cytochrome P450 monooxygenase. 

58. A method according to Claim 57 wherein the cytochrome P450 protein is P450 
2A6 encoded by a nucleotide sequence substantially as set forht in <400>17 or a 
nucleotide sequence having at least 50% similarity thereto after optimal alignment or a 
nucleotide sequence capable of hybridizing to <400>17 or its complementary form under 
low stringency conditions. 

59. A method according to Claim 57 wherein the cytochrome P450 protein is P450 
2E1 encoded by a nucleotide sequence substantially as set forth in <400>18 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
seuqence capable of hybridizing to <400>18 or its complementary form under low 
stringency conditions. 

60. A method according to Claim 57 wherein the cytochrome P450 protein is P450 
2C19 encoded by a nucleotide sequence substantially as set forth in <400>19 or a 
nucleotide sequence having at least 50% similarity thereto after optimal alignment or a 
nucleotide sequence capable of hybridizing to <400>19 or its complementary form under 
low stringency conditions. 

61. A method according to Claim 57 wherein the cytochrome P450 protein is P450 
cin encoded by a nucleotide sequence substantially as set forth in <400>20 or a nucleotide 
sequence having at least 50% similarity thereto after optimal alignment or a nucleotide 
sequence capable of hybridizing to <400>20 or its complementary form under low 
stringency conditions. 
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SEQUENCE LISTING 



<110> The University of Queensland, Vanderbilt University 



<120> Pigment production 

<130> 2302839/EJH 

<140> International 

<141> 2000-08-11 



<150> PQ2156 
<151> 1999-08-11 

<160> 22 



<170> Patentln Ver. 2.1 



<210> 1 

<211> 1485 

<212> DNA 

<213> Human cells 



<220> 

<221> CDS 

<222> (1) . . (1482) 

<400> 1 

atg get get tea gga atg tta tta 
Met Ala Ala Ser Gly Met Leu Leu 
1 5 

gtg atg gtc ttg atg tct gtt tgg 
Val Met Val Leu Met Ser Val Trp 
20 



gtg gee ttg ctg gtc tgc ctg act 48 
Val Ala Leu Leu Val Cys Leu Thr 
10 15 

cag cag agg aag age aag ggg aag 96 
Gin Gin Arg Lys Ser Lys Gly Lys 
25 30 



ctg cct ccg gga ccc acc cca ttg ccc ttc att gga aac tac ctg cag 144 
Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe He Gly Asn Tyr Leu Gin 
35 40 45 
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ctg aac aca gag cag atg tac aac tec etc atg aag ate agt gag cgc 192 
Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys lie Ser Glu Arg 
50 55 60 

tat ggc ccc gtg ttc ace att cac ttg ggg ccc egg egg gtc gtg gtg 240 
Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

ctg tgt gga cat gat gec gtc agg gag get ctg gtg gac cag get gag 2 88 
Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 

gag ttc age ggg cga ggc gag caa gee acc ttc gac tgg gtc ttc aaa 336 
Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 110 

ggc tat ggc gtg gta ttc age aac ggg gag cgc gee aag cag etc egg 3 84 
Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

cgc ttc tec ate gee acc ctg egg gac ttc ggg gtg ggc aag cga ggc 432 
Arg Phe Ser lie Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 135 140 

ate gag gag cgc ate cag gag gag gcg ggc ttc etc ate gac gee etc 480 
He Glu Glu Arg He Gin Glu Glu Ala Gly Phe Leu He Asp Ala Leu 
145 150 155 160 

egg ggc act ggc ggc gec aat ate gat ccc acc ttc ttc ctg age cgc 528 
Arg Gly Thr Gly Gly Ala Asn He Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 

aca gtc tec aat gtc ate age tec att gtc ttt ggg gac cgc ttt gac 576 
Thr Val Ser Asn Val He Ser Ser He Val Phe Gly Asp Arg Phe Asp 
180 185 190 

tat aag gac aaa gag ttc ctg tea ctg ttg cgc atg atg eta gga ate 624 
Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly lie 
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195 200 205 

ttc cag ttc acg tea acc tec acg ggg cag etc tat gag atg ttc tct 672 
Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 
210 215 220 

teg gtg atg aaa cac ctg cca gga cca cag caa cag gee ttt cag ttg 72 0 
Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 
225 230 235 240 

ctg caa ggg ctg gag gac ttc ata gec aag aag gtg gag cac aac cag 768 
Leu Gin Gly Leu Glu Asp Phe lie Ala Lys Lys Val Glu His Asn Gin 
245 250 255 

cgc acg ctg gat ccc aat tec cca egg gac ttc att gac tec ttt etc 816 
Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe. lie Asp Ser Phe Leu 
260 265 270 

ate cgc atg cag gag gag gag aag aac ccc aac acg gag ttc tac ttg 864 
lie Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 280 285 

aaa aac ctg gtg atg acc acg ttg aac etc ttc att ggg ggc acc gag 912 
Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe lie Gly Gly Thr Glu 
290 295 300 

acc gtc age acc acc ctg cgc tat ggc ttc ttg ctg etc atg aag cac 960 
Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 

cca gag gtg gag gec aag gtc cat gag gag att gac aga gtg ate ggc 1008 
Pro Glu Val Glu Ala Lys Val His Glu Glu He Asp Arg Val lie Gly 
325 330 335 

aag aac egg cag ccc aag ttt gag gac egg gec aag atg ccc tac atg 1056 
Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 350 



gag gca gtg ate cac gag ate caa aga ttt gga gac gtg ate ccc atg 



1104 
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Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val He Pro Met 
355 360 . 365 

agt ttg gcc cgc aga gtc aaa aag gac acc aag ttt egg gat ttc ttc 1152 
Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 

etc cct aag ggc acc gaa gtg tac cct atg ctg ggc tct gtg ctg aga 12 00 
Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 

gac ccc agt ttc ttc tec aac ccc cag gac ttc aat ccc cag cac ttc 1248 
Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 

ctg aat gag aag ggg cag ttt aag aag agt gat get ttt gtg ccc ttt 1296 
Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

tec ate gga aag egg aac tgt ttc gga gaa ggc ctg gcc aga atg gag 1344 
Ser He Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

etc ttt etc ttc ttc acc acc gtc atg cag aac ttc cgc etc aag tec 1392 
Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

tec cag tea cct aag gac att gac gtg tec ccc aaa cac gtg ggc ttt 1440 
Ser Gin Ser Pro Lys Asp He Asp Val Ser Pro Lys His Val Gly Phe 
465 470 475 480 

gcc acg ate cca cga aac tac acc atg age ttc ctg ccc cgc tga 14 85 

Ala Thr He Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 



<210> 2 
<211> 494 
<212> PRT 
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<213> Human cells 
<400> 2 

Met Ala Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
15 10 15 

Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lys 
20 25 30 

Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe lie Gly Asn Tyr Leu Gin 
35 40 45 

Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys lie Ser Glu Arg 
50 55 60 

Tyr Gly Pro Val Phe Thr lie His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 110 

Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

Arg Phe Ser lie Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 135 140 

lie Glu Glu Arg lie Gin Glu Glu Ala Gly Phe Leu lie Asp Ala Leu 
145 150 155 160 

Arg Gly Thr Gly Gly Ala Asn lie Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 



Thr Val Ser Asn Val He Ser Ser He Val Phe Gly Asp Arg Phe Asp 
180 185 190 
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Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly lie 
195 200 205 

Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 
210 215 220 

Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 
225 230 235 240 

Leu Gin Gly Leu Glu Asp Phe lie Ala Lys Lys Val Glu His Asn Gin 
245 250 255 

Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe lie Asp Ser Phe Leu 
260 265 270 

lie Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 280 285 

Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe lie Gly Gly Thr Glu 
290 295 300 

Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 

Pro Glu Val Glu Ala Lys Val His Glu Glu He Asp Arg Val He Gly 
325 330 335 

Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 350 

Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val He Pro Met 
355 360 365 

Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 



Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 



WO 01/14565 



PCT/AUOO/00966 



-7- 

Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 

Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

Ser lie Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

Ser Gin Ser Pro Lys Asp lie Asp Val Ser Pro Lys His Val Gly Phe 
465 470 475 480 

Ala Thr lie Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 



<210> 3 

<211> 1419 

<212> DNA 

<213> Human cells 

<220> 

<221> CDS 

<222> (1) . . (1416) 

<400> 3 

atg get cgt caa gtt cat tct tct tgg aat ctg ccc cca ggc cct ttc 48 

Met Ala Arg Gin Val His Ser Ser Trp Asn Leu Pro Pro Gly Pro Phe 
15 10 15 

ccg ctt ccc ate ate ggg aac etc ttc cag ttg gaa ttg aag aat att 96 
Pro Leu Pro He He Gly Asn Leu Phe Gin Leu Glu Leu Lys Asn He 
20 25 30 

ccc aag tec ttc ace egg ttg gee cag cgc ttc ggg ccg gtg ttc acg 144 



T 
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Pro Lys Ser Phe Thr Arg Leu Ala Gin Arg Phe Gly Pro Val Phe Thr 
35 40 45 



ctg tac gtg ggc teg cag cgc atg gtg gtg atg cac ggc tac aag gcg' 192 
Leu Tyr Val Gly Ser Gin Arg Met Val Val Met His Gly Tyr Lys Ala 
50 55 60 

gtg aag gaa gcg ctg ctg gac tac aag gac gag ttc teg ggc aga ggc 240 
Val Lys Glu Ala Leu Leu Asp Tyr Lys Asp Glu Phe Ser Gly Arg Gly 
65 70 75 80 



gac etc ccc gcg ttc cat gcg cac agg gac agg gga ate att ttt aat 288 
Asp Leu Pro Ala Phe His Ala His Arg Asp Arg Gly lie lie Phe Asn 
85 90 95 



aat gga cct acc tgg aag gac ate egg egg ttt tec ctg ace ace etc 336 
Asn Gly Pro Thr Trp Lys Asp lie Arg Arg Phe Ser Leu Thr Thr Leu 
100 105 110 



egg aac tat ggg atg ggg aaa cag ggc aat gag age egg ate cag agg 384 
Arg Asn Tyr Gly Met Gly Lys Gin Gly Asn Glu Ser Arg lie Gin Arg 
115 120 125 

gag gec cac ttc ctg ctg gaa gca etc agg aag acc caa ggc cag cct 432 
Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys Thr Gin Gly Gin Pro 
130 135 140 



ttc gac ccc acc ttc etc ate ggc 

Phe Asp Pro Thr Phe Leu lie Gly 

145 150 

gac ate etc ttc cgc aag cat ttt 

Asp lie Leu Phe Arg Lys His Phe 
165 



tgc gcg ccc tgc aac gtc ata gee 480 
Cys Ala Pro Cys Asn Val lie Ala 
155 160 

gac tac aat gat gag aag ttt eta 528 
Asp Tyr Asn Asp Glu Lys Phe ' Leu 
170 175 



agg ctg atg tat ttg ttt aat gag aac ttc cac eta etc age act ccc 576 
Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His Leu Leu Ser Thr Pro 
180 185 190 
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tgg etc cag ctt tac aat aat ttt ccc age ttt eta cac tac ttg cct 624 
Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe Leu His Tyr Leu Pro 
195 200 205 



gga age cac aga aaa gtc ata aaa aat gtg get gaa gta aaa gag tat 672 
Gly Ser His Arg Lys Val lie Lys Asn Val Ala Glu Val Lys Glu Tyr 
210 215 220 



gtg tct gaa agg gtg aag gag cac cat caa tct ctg gac ccc aac tgt 720 
Val Ser Glu Arg Val Lys Glu His His Gin Ser Leu Asp Pro Asn Cys 
225 230 235 240 



ccc egg gac etc ace gac tgc ctg etc gtg gaa atg gag aag gaa aag 768 
Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu Met Glu Lys Glu Lys 
245 250 255 



cac agt gca gag cgc ttg tac aca atg gac ggt ate acc gtg act gtg 816 
His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly lie Thr Val Thr Val 
260 265 270 



gee gac ctg ttc ttt gcg ggg aca gag acc acc age aca act ctg aga 864 
Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu Arg 
275 280 285 

tat ggg etc ctg att etc atg aaa tac cct gag ate gaa gag aag etc 912 
Tyr Gly Leu Leu lie Leu Met Lys Tyr Pro Glu He Glu Glu Lys Leu 
290 295 300 

cat gaa gaa att gac agg gtg att ggg cca age cga ate cct gee ate 960 
His Glu Glu He Asp Arg Val He Gly Pro Ser Arg He Pro Ala He 
305 310 315 320 



aag gat agg caa gag atg ccc tac atg gat get gtg gtg cat gag att 1008 
Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala Val Val His Glu He 
325 330 335 



cag egg ttc ate acc etc gtg ccc tec aac ctg ccc cat gaa gca acc 1056 
Gin Arg Phe He Thr Leu Val Pro Ser Asn Leu Pro His Glu Ala Thr 
340 345 350 
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cga gac ace att ttc aga gga tac etc ate ccc aag ggc aca gtc gta 1104 

Arg Asp Thr lie Phe Arg Gly Tyr Leu He Pro Lys Gly Thr Val Val 
355 360 365 

gtg cca act ctg gac tct gtt ttg tat gac aac caa gaa ttt cct gat 1152 

Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn Gin Glu Phe Pro Asp 

370 375 380 

cca gaa aag ttt aag cca gaa cac ttc ctg aat gaa aat gga aag ttc 1200 

Pro Glu Lys Phe Lys Pro Glu His Phe Leu Asn Glu Asn Gly Lys Phe 
385 390 395 400 

aag tac agt gac tat ttc aag cca ttt tec aca gga aaa cga gtg tgt 1248 

Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr Gly Lys Arg Val Cys 
405 410 415 

get gga gaa ggc ctg get cgc atg gag ttg ttt ctt ttg ttg tgt gee 12 96 

Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Leu Leu Cys Ala 
420 425 430 

att ttg cag cat ttt aat ttg aag cct etc gtt gac cca aag gat ate 1344 

He Leu Gin His Phe Asn Leu Lys Pro Leu Val Asp Pro Lys Asp He 
435 440 445 

gac etc age cct ata cat att ggg ttt ggc tgt ate cca cca cgt tac 1392 

Asp Leu Ser Pro He His He Gly Phe Gly Cys He Pro Pro Arg Tyr 

450 455 460 

aaa etc tgt gtc att ccc cgc tea tga 1419 
Lys Leu Cys Val He Pro Arg Ser 
465 470 



<210> 4 

<211> 472 

<212> PRT 

<213> Human cells 
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<400> 4 

Met Ala Arg Gin Val His Ser Ser Trp Asn Leu Pro Pro Gly Pro Phe 
15 10 15 



Pro Leu Pro He He Gly Asn Leu 
20 

Pro Lys Ser Phe Thr Arg Leu Ala 
35 40 



Phe Gin Leu Glu Leu Lys Asn He 
25 30 

Gin Arg Phe Gly Pro Val Phe Thr 
45 



Leu. Tyr Val Gly Ser Gin Arg Met Val Val Met His Gly Tyr Lys Ala 

50 55 60 

Val Lys Glu Ala Leu Leu Asp Tyr Lys Asp Glu Phe Ser Gly Arg Gly 

65 70 75 80 

Asp Leu Pro Ala Phe His Ala His Arg Asp Arg Gly He He Phe Asn 

85 ' 90 95 



Asn Gly Pro Thr Trp Lys Asp He 
100 

Arg Asn Tyr Gly Met Gly Lys Gin 

115 120 



Arg Arg Phe Ser Leu Thr Thr Leu 
105 110 

Gly Asn Glu Ser Arg He Gin Arg 
125 



Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys Thr Gin Gly Gin Pro 
130 135 140 

Phe Asp Pro Thr Phe Leu He Gly Cys Ala Pro Cys Asn Val He Ala 
145 150 155 160 

Asp He Leu Phe Arg Lys His Phe Asp Tyr Asn Asp Glu Lys Phe Leu 
165 170 175 

Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His Leu Leu Ser Thr Pro 
180 185 190 



Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe Leu His Tyr Leu Pro 
195 200 205 
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Gly Ser His Arg Lys Val He Lys Asn Val Ala Glu Val Lys Glu Tyr 
210 215 ' ' 220 

Val Ser Glu Arg Val Lys Glu His His Gin Ser Leu Asp Pro Asn Cys 
225 230 235 240 

Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu Met Glu Lys Glu Lys 
245 250 255 

His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly He Thr Val Thr Val 
260 265 270 

Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu Arg 
275 280 285 

Tyr Gly Leu Leu He Leu Met Lys Tyr Pro Glu He Glu Glu Lys Leu 
290 295 300 

His Glu Glu He Asp Arg Val He Gly Pro Ser Arg He Pro Ala He 
305 310 315 320 

Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala Val Val His Glu He 
325 330 335 

Gin Arg Phe He Thr Leu Val Pro Ser Asn Leu Pro His Glu Ala Thr 
340 345 350 

Arg Asp Thr He Phe Arg Gly Tyr Leu He Pro Lys Gly Thr Val Val 
355 360 365 

Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn Gin Glu Phe Pro Asp 
370 375 380 



Pro Glu Lys Phe Lys Pro Glu His Phe Leu Asn Glu Asn Gly Lys Phe 
385 390 395 400 



Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr Gly Lys Arg Val Cys 
405 410 415 
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Ala Gly Glu Gly Leu Ala Arg Met 
420 

lie Leu Gin His Phe Asn Leu Lys 
43S 440 

Asp Leu Ser Pro lie His lie Gly 
450 455 

Lys Leu Cys Val lie Pro Arg Ser 
465 470 



Glu Leu Phe Leu Leu Leu Cys Ala 
425 430 

Pro Leu Val Asp Pro Lys Asp lie 
445 

Phe Gly Cys lie, Pro Pro Arg Tyr 
460 



<210> 5 

<211> 1438 

<212> DNA 

<213> Human cells 

<220> 

<221> CDS 

<222> (1) . . (1416) 

<400> 5 

atg get cga caa tct tct gga cga gga aaa etc cct cct ggc cct act 48 
Met Ala Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly Pro Thr 
15 10 15 

cct etc cca gtg att gga aat ate eta cag ata gat att aag gat gtc 96 
Pro Leu Pro Val lie Gly Asn lie Leu Gin lie Asp lie Lys Asp Val 
20 25 30 

age aaa tec tta ace aat etc tea aaa ate tat ggc cct gtg ttc act 144 
Ser Lys Ser Leu Thr Asn Leu Ser Lys lie Tyr Gly Pro Val Phe Thr 
35 40 45 

ctg tat ttt ggc ctg gaa cgc atg gtg gtg ctg cat gga tat gaa gtg 192 
Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr Glu Val 
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50 55 60 

gtg aag gaa gcc ctg att gat ctt gga gag gag ttt tct gga aga ggc 240 
Val Lys Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly Arg Gly 
65 70 75 80 

cat ttc cca ctg get gaa aga get aac aga gga ttt gga ate gtt ttc 2 88 
His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly He Val Phe 
85 90 95 

age aat gga aag aga tgg aag gag ate egg cgt ttc tec etc atg acg 33 6 
Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu Met Thr 
100 105 110 

ctg egg aat ttt ggg atg ggg aag agg age att gag gac cgt gtt caa 3 84 
Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg Val Gin 
115 120 125 

gag gaa gcc cgc tgc ctt gtg gag gag ttg aga aaa acc aag get tea 432 
Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys Ala Ser 
130 135 140 

ccc tgt gat ccc act ttc ate ctg ggc tgt get ccc tgc aat gtg ate 4 80 
Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn Val He 
145 150 155 160 

tgc tec att att ttc cag aaa cgt ttc gat tat aaa gat cag caa ttt 52 8 
Cys Ser He He Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin Gin Phe 
165 170 175 

ctt aac ttg atg gaa aaa ttg aat gaa aac ate agg att gta age acc 576 
Leu Asn Leu Met Glu Lys Leu Asn Glu Asn He Arg He Val Ser Thr 
180 185 190 

ccc tgg ate cag ata tgc aat aat ttt ccc act ate att gat tat ttc 624 
Pro Trp He Gin He Cys Asn Asn Phe Pro Thr He He Asp Tyr Phe 
195 200 205 



ccg gga acc cat aac aaa tta ctt aaa aac ctt get ttt atg gaa agt 672 
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Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met Glu Ser 
210 215 220 

gat att ttg gag aaa gta aaa gaa cac caa gaa teg atg gac ate aac 72.0 
Asp lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp lie Asn 
225 230 235 240 



aac cct egg gac ttt att gat tgc ttc ctg ate aaa atg gag aag gaa 7 68 
Asn Pro Arg Asp Phe lie Asp Cys Phe Leu lie Lys Met Glu Lys Glu 
245 250 255 



aag caa aac caa cag tct gaa ttc act att gaa aac ttg gta ate act 816 
Lys Gin Asn Gin Gin Ser Glu Phe Thr lie Glu Asn Leu Val lie Thr 
260 265 270 



gca get gac tta ctt gga get ggg aca gag aca aca age aca acc ctg 864 
Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu 
275 280 285 

aga tat get etc ctt etc ctg ctg aag cac cca gag gtc aca get aaa 912 
Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr Ala Lys 
290 295 300 



gtc cag gaa gag att gaa cgt gtc gtt ggc aga aac egg age ccc tgc 960 
Val Gin Glu Glu lie Glu Arg Val Val Gly Arg Asn Arg Ser Pro Cys 
305 310 315 320 



atg cag gac agg ggc cac atg ccc tac aca gat get gtg gtg cac gag 1008 
Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val His Glu 
325 330 335 



gtc cag aga tac ate gac etc ate ccc acc age ctg ccc cat gca gtg 1056 
Val Gin Arg Tyr lie Asp Leu lie Pro Thr Ser Leu Pro His Ala Val 
340 345 350 



acc tgt gac gtt aaa ttc aga aac tac etc att ccc aag ggc aca acc 1104 
Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly Thr Thr 
355 360 365 
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ata tta act tec etc act tct gtg eta cat gac aac aaa gaa ttt ccc 1152 
He Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu Phe Pro 
370 375 380 

aac cca gag atg ttt gac cct cgt cac ttt ctg gat gaa ggt gga aat 1200 
Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly Gly Asn 
385 390 395 400 

ttt aag aaa agt aac tac ttc atg cct ttc tea gca gga aaa egg att 124 8 
Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys Arg He 
405 410 415 

tgt gtg gga gag ggc ctg gec cgc atg gag ctg ttt tta ttc ctg acc 1296 
Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe Leu Thr 
420 425 430 

ttc att tta cag aac ttt aac ctg aaa tct ctg att gac cca aag gac 1344 
Phe He Leu Gin Asn Phe Asn Leu Lys Ser Leu He Asp Pro Lys Asp 
435 440 445 

ctt gac aca act cct gtt gtc aat gga ttt get tct gtc ccg ccc ttc 1392 
Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro Pro Phe 
450 455 460 

tat cag ctg tgc ttc att cct gtc tgaagaagca cagatggtct gg 143 8 

Tyr Gin Leu Cys Phe He Pro Val 
465 470 



<210> 6 

<211> 472 

<212> PRT 

<213> Human cells 

<400> 6 

Met Ala Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly Pro Thr 
15 10 15 

Pro Leu Pro Val He Gly Asn He Leu Gin He Asp He Lys Asp Val 
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20 



25 



30 



Ser Lys Ser Leu Thr Asn Leu Ser Lys lie Tyr Gly Pro Val Phe Thr 
35 40 45 

Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr Glu Val 
50 55 60 

Val Lys Glu Ala Leu lie Asp Leu Gly Glu Glu Phe Ser Gly Arg Gly 
65 70 75 80 

His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly lie Val Phe 
85 90 95 

Ser Asn Gly Lys Arg Trp Lys Glu lie Arg Arg Phe Ser Leu Met Thr 
100 105 110 

Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg Val Gin 
115 120 125 

Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys Ala Ser 
130 135 140 

Pro Cys Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asn Val lie 
145 150 155 160 

Cys Ser lie lie Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin Gin Phe 
165 170 175 

Leu Asn Leu Met Glu Lys Leu Asn Glu Asn He Arg He Val Ser Thr 
180 185 190 

Pro Trp He Gin He Cys Asn Asn Phe Pro Thr He He Asp Tyr Phe 
195 200 205 



Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met Glu Ser 
210 215 220 



Asp He Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp He Asn 
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225 



230 



235 



240 



Asn Pro Arg Asp Phe lie Asp Cys Phe Leu lie Lys Met Glu Lys Glu 
245 250 255 

Lys Gin Asn Gin Gin Ser Glu Phe Thr lie Glu Asn Leu Val lie Thr 
260 265 270 

Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu 
275 280 285 

Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr Ala Lys 
290 295 300 

Val Gin Glu Glu lie Glu Arg Val Val Gly Arg Asn Arg Ser Pro Cys 
305 310 315 320 

Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val His Glu 
325 330 335 

Val Gin Arg Tyr lie Asp Leu lie Pro Thr Ser Leu Pro His Ala Val 
340 345 350 

Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly Thr Thr 
355 360 365 

lie Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu Phe Pro 
370 375 380 

Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly Gly Asn 
385 390 395 400 

Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys Arg lie 
405 410 415 



Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe Leu Thr 
420 425 430 



Phe lie Leu Gin Asn Phe Asn Leu Lys Ser Leu He Asp Pro Lys Asp 
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435 , 440 445 

Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro Pro Phe 
450 455 460 

Tyr Gin Leu Cys Phe lie Pro Val 
465 470 



<210> 7 

<211> 1485 

<212> DNA 

<213> Human cells 



<220> 

<221> CDS 

<222> (1) . . (1482) 



<400> 7 

atg ctg gcc tea ggg atg ctt ctg 
Met Leu Ala Ser Gly Met Leu Leu 
1 5 



gtg gcc ttg ctg gtc tgc ctg act 4 8 
Val Ala Leu Leu Val Cys Leu Thr 
10 15 



gtg atg gtc ttg atg tct gtt tgg cag cag agg aag age aag ggg aag 96 
Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys Ser Lys Gly Lys 
20 25 30 

ctg cct ccg gga ccc acc cca ttg ccc ttc att gga aac tac ctg cag 144 
Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe lie Gly Asn Tyr Leu Gin 
35 40 45 



ctg aac aca gag cag atg tac aac tec etc atg aag ate agt gag cgc 192 
Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys lie Ser Glu Arg 
50 55 60 



tat ggc ccc gtg ttc acc att cac ttg ggg ccc egg egg gtc gtg gtg 240 
Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 
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ctg tgt gga cat gat gcc gtc agg gag get ctg gtg gac cag get gag 288 
Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 



gag ttc age ggg cga ggc gag caa gcc acc ttc gac tgg gtc ttc aaa 336 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 110 

ggc tat ggc gtg gta ttc age aac ggg gag cgc gcc aag cag etc egg 384 

Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 



cgc ttc tec ate gcc acc ctg egg gac ttc ggg gtg ggc aag cga ggc 432 
Arg Phe Ser lie Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 135 140 

ate gag gag cgc ate cag gag gag gcg ggc ttc etc ate gac gcc etc 4 80 
lie Glu Glu Arg lie Gin Glu Glu Ala Gly Phe Leu lie Asp Ala Leu 
145 150 155 160 

egg ggc act ggc ggc gcc aat ate gat ccc acc ttc ttc ctg age cgc 528 
Arg Gly Thr Gly Gly Ala Asn lie Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 



aca gtc tec aat gtc ate age tec att gtc ttt ggg gac cgc ttt gac 576 
Thr Val Ser Asn Val lie Ser Ser lie Val Phe Gly Asp Arg Phe Asp 
180 185 190 



tat aag gac aaa gag ttc ctg tea ctg ttg cgc atg atg eta gga ate 624 

Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly lie 

195 200 205 

ttc cag ttc acg tea acc tec acg ggg cag etc tat gag atg ttc tct 672 

Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 
210 215 220 



teg gtg atg aaa cac ctg cca gga cca cag caa cag gcc ttt cag ttg 720 
Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 
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225 230 235 240 

ctg caa ggg ctg gag gac ttc ata gcc aag aag gtg gag cac aac cag 768 
Leu Gin Gly Leu Glu Asp Phe lie Ala Lys Lys Val Glu His Asn Gin 
245 250 255 



cgc acg ctg gat ccc aat tec cca egg gac ttc att gac tec ttt etc 816 
Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe lie Asp Ser Phe Leu 
260 265 270 



ate cgc atg cag gag gag gag aag aac ccc aac acg gag ttc tac ttg 864 
lie Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 280 285 



aaa aac ctg gtg atg ace acg ttg aac etc ttc att ggg ggc acc gag 912 
Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe lie Gly Gly Thr Glu 
290 295 300 

acc gtc age acc acc ctg cgc tat ggc ttc ttg ctg etc atg aag cac 960 
Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 



cca gag gtg gag gcc aag gtc cat gag gag att gac aga gtg ate ggc 1008 
Pro Glu Val Glu Ala Lys Val His Glu Glu lie Asp Arg Val He Gly 
325 330 335 

aag aac egg cag ccc aag ttt gag gac egg gcc aag atg ccc tac atg 1056 
Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 350 



gag gca gtg ate cac gag ate caa aga ttt gga gac gtg ate ccc atg 1104 
Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val He Pro Met 
355 360 365 

agt ttg gcc cgc aga gtc aaa aag gac acc aag ttt egg gat ttc ttc 1152 
Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 

etc cct aag ggc acc gaa gtg tac cct atg ctg ggc tct gtg ctg aga 12 0 0 
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Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 

gac ccc agt ttc ttc tec aac ccc cag gac ttc aat ccc cag cac ttc 1248 
Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 

ctg aat gag aag ggg cag ttt aag aag agt gat get ttt gtg ccc ttt 1296 
Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 

tec ate gga aag egg aac tgt ttc gga gaa ggc ctg gee aga atg gag 1344 
Ser lie Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 

etc ttt etc ttc ttc acc ace gtc atg cag aac ttc cgc etc aag tec 1392 
Leu Phe Leu Phe Phe Thr Thr Val Met Gin Asn Phe Arg Leu Lys Ser 
450 455 460 

tec cag tea cct aag gac att gac gtg tec ccc aaa cac gtg ggc ttt 1440 
Ser Gin Ser Pro Lys Asp lie Asp Val Ser Pro Lys His Val Gly Phe 
465 470 475 480 

gee acg ate cca cga aac tac acc atg age ttc ctg ccc cgc tga 1485 
Ala Thr He Pro Arg Asn Tyr Thr Met Ser Phe Leu Pro Arg 
485 490 



<210> 8 

<211> 494 

<212> PRT 

<213> Human cells 

<400> 8 

Met Leu Ala Ser Gly Met Leu Leu Val Ala Leu Leu Val Cys Leu Thr 
15 10 15 



Val Met Val Leu Met Ser Val Trp Gin Gin Arg Lys 
20 25 



Ser Lys Gly Lys 
30 
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Leu Pro Pro Gly Pro Thr Pro Leu Pro Phe lie Gly Asn Tyr Leu Gin 
35 40 45 

Leu Asn Thr Glu Gin Met Tyr Asn Ser Leu Met Lys lie Ser Glu Arg 
50 55 60 

Tyr Gly Pro Val Phe Thr He His Leu Gly Pro Arg Arg Val Val Val 
65 70 75 80 

Leu Cys Gly His Asp Ala Val Arg Glu Ala Leu Val Asp Gin Ala Glu 
85 90 95 

Glu Phe Ser Gly Arg Gly Glu Gin Ala Thr Phe Asp Trp Val Phe Lys 
100 105 110 

Gly Tyr Gly Val Val Phe Ser Asn Gly Glu Arg Ala Lys Gin Leu Arg 
115 120 125 

Arg Phe Ser He Ala Thr Leu Arg Asp Phe Gly Val Gly Lys Arg Gly 
130 135 140 

He Glu Glu Arg He Gin Glu Glu Ala Gly Phe Leu He Asp Ala Leu 
145 150 155 160 

Arg Gly Thr Gly Gly Ala Asn He Asp Pro Thr Phe Phe Leu Ser Arg 
165 170 175 

Thr Val Ser Asn Val He Ser Ser He Val Phe Gly Asp Arg Phe Asp 
180 185 190 

Tyr Lys Asp Lys Glu Phe Leu Ser Leu Leu Arg Met Met Leu Gly He 
195 200 205 



Phe Gin Phe Thr Ser Thr Ser Thr Gly Gin Leu Tyr Glu Met Phe Ser 
210 215 220 



Ser Val Met Lys His Leu Pro Gly Pro Gin Gin Gin Ala Phe Gin Leu 
225 230 235 240 



WO 01/14565 



PCT/AUOO/00966 



-24- 



Leu Gin Gly Leu Glu Asp Phe lie Ala Lys Lys Val Glu His Asn Gin 
245 250 255 

Arg Thr Leu Asp Pro Asn Ser Pro Arg Asp Phe lie Asp Ser Phe Leu 
260 265 270 

lie Arg Met Gin Glu Glu Glu Lys Asn Pro Asn Thr Glu Phe Tyr Leu 
275 280 285 

Lys Asn Leu Val Met Thr Thr Leu Asn Leu Phe lie Gly Gly Thr Glu 
290 295 300 

Thr Val Ser Thr Thr Leu Arg Tyr Gly Phe Leu Leu Leu Met Lys His 
305 310 315 320 

Pro Glu Val Glu Ala Lys Val His Glu Glu He Asp Arg Val He Gly 
325 330 335 

Lys Asn Arg Gin Pro Lys Phe Glu Asp Arg Ala Lys Met Pro Tyr Met 
340 345 350 

Glu Ala Val He His Glu He Gin Arg Phe Gly Asp Val He Pro Met 
355 360 365 

Ser Leu Ala Arg Arg Val Lys Lys Asp Thr Lys Phe Arg Asp Phe Phe 
370 375 380 

Leu Pro Lys Gly Thr Glu Val Tyr Pro Met Leu Gly Ser Val Leu Arg 
385 390 395 400 

Asp Pro Ser Phe Phe Ser Asn Pro Gin Asp Phe Asn Pro Gin His Phe 
405 410 415 

Leu Asn Glu Lys Gly Gin Phe Lys Lys Ser Asp Ala Phe Val Pro Phe 
420 425 430 



Ser He Gly Lys Arg Asn Cys Phe Gly Glu Gly Leu Ala Arg Met Glu 
435 440 445 
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Leu Phe Leu Phe Phe Thr Thr Val 
450 455 

Ser Gin Ser Pro Lys Asp lie Asp 
465 470 

Ala Thr lie Pro Arg Asn Tyr Thr 
485 



Met Gin Asn Phe Arg Leu Lys Ser 
460 

Val Ser Pro Lys His Val Gly Phe 

475 480 

Met Ser Phe Leu Pro Arg 
490 



<210> 9 
<211> 1479 
<212> DNA 
<213> Human cells 

<220> 

<221> CDS 

<222> (1) . . (1479) 

<400> 9 

atg tct gcc etc gga gtc acc gtg gec ctg ctg gtg tgg gcg gec ttc 4 8 

Met Ser Ala Leu Gly Val Thr Val Ala Leu Leu Val Trp Ala Ala Phe 

15 10 15 

etc ctg ctg gtg tec atg tgg agg cag gtg cac age age tgg aat ctg 96 
Leu Leu Leu Val Ser Met Trp Arg Gin Val His Ser Ser Trp Asn Leu 
20 25 30 

ccc cca ggc cct ttc ccg ctt ccc ate ate ggg aac etc ttc cag ttg 144 
Pro Pro Gly Pro Phe Pro Leu Pro lie lie Gly Asn Leu Phe Gin Leu 
35 40 45 



gaa ttg aag aat att ccc aag tec ttc acc egg ttg gcc cag cgc ttc 192 
Glu Leu Lys Asn lie Pro Lys Ser Phe Thr Arg Leu Ala Gin Arg Phe 
50 55 60 

ggg ccg gtg ttc acg ctg tac gtg ggc teg cag cgc atg gtg gtg atg 240 
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Gly Pro Val Phe Thr Leu Tyr Val Gly Ser Gin Arg Met Val Val Met 
65 70 75 80 

cac ggc tac aag gcg gtg aag gaa gcg ctg ctg gac tac aag gac gag 288 
His Gly Tyr Lys Ala Val Lys Glu Ala Leu Leu Asp Tyr Lys Asp Glu 
85 90 95 

ttc teg ggc aga ggc gac etc ccc gcg ttc cat gcg cac agg gac agg 336 
Phe Ser Gly Arg Gly Asp Leu Pro Ala Phe His Ala His Arg Asp Arg 
100 105 110 

gga ate att ttt aat aat gga cct acc tgg aag gac ate egg egg ttt 384 
Gly lie lie Phe Asn Asn Gly Pro Thr Trp Lys Asp lie Arg Arg Phe 
115 120 125 

tec ctg acc acc etc egg aac tat ggg atg ggg aaa cag ggc aat gag 432 
Ser Leu Thr Thr Leu^ Arg Asn Tyr Gly Met Gly Lys Gin Gly Asn Glu 
130 135 140 

age egg ate cag agg gag gee cac ttc ctg ctg gaa gca etc agg aag 480 
Ser Arg lie Gin Arg Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys 
145 150 155 160 

acc caa ggc cag cct ttc gac ccc acc ttc etc ate ggc tgc gcg ccc 528 
Thr Gin Gly Gin Pro Phe Asp Pro Thr Phe Leu lie Gly Cys Ala Pro 
165 170 175 

tgc aac gtc ata gec gac ate etc ttc cgc aag cat ttt gac tac aat 576 
Cys Asn Val lie Ala Asp lie Leu Phe Arg Lys His Phe Asp Tyr Asn 
180 185 190 

gat gag aag ttt eta agg ctg atg tat ttg ttt aat gag aac ttc cac 624 
Asp Glu Lys Phe Leu Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His 
195 200 205 

eta etc age act ccc tgg etc cag ctt tac aat aat ttt ccc age ttt 672 
Leu Leu Ser Thr Pro Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe 
210 215 220 
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cta cac tac ttg cct gga age cac aga aaa gtc ata aaa aat gtg get 720 
Leu His Tyr Leu Pro Gly Ser His Arg Lys Val lie Lys Asn Val Ala 
225 230 235 240 

gaa gta aaa gag tat gtg tct gaa agg gtg aag gag cac cat caa tct 7 68 
Glu Val Lys Glu Tyr Val Ser Glu Arg Val Lys Glu His His Gin Ser 
245 250 255 

ctg gac ccc aac tgt ccc egg gac etc ace gac tgc ctg etc gtg gaa 816 
Leu Asp Pro Asn Cys Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu 
260 265 270 

atg gag aag gaa aag cac agt gca gag cgc ttg tac aca atg gac ggt 864 
Met Glu Lys Glu Lys His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly 
275 280 285 

ate acc gtg act gtg gee gac ctg ttc ttt gcg ggg aca gag acc ace 912 
lie Thr Val Thr Val Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr 
290 295 300 

age aca act ctg aga tat ggg etc ctg att etc atg aaa tac cct gag 960 
Ser Thr Thr Leu Arg Tyr Gly Leu Leu lie Leu Met Lys Tyr Pro Glu 
305 310 315 320 

ate gaa gag aag etc cat gaa gaa att gac agg gtg att ggg cca age 1008 
lie Glu Glu Lys Leu His Glu Glu lie Asp Arg Val lie Gly Pro Ser 
325 330 335 

cga ate cct gee ate aag gat agg caa gag atg ccc tac atg gat get 1056 
Arg lie Pro Ala lie Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala 
340 345 350 

gtg gtg cat gag att cag egg ttc ate acc etc gtg ccc tec aac ctg 1104 
Val Val His Glu lie Gin Arg Phe lie Thr Leu Val Pro Ser Asn Leu 
355 360 365 

ccc cat gaa gca acc cga gac acc att ttc aga gga tac etc ate ccc 1152 
Pro His Glu Ala Thr Arg Asp Thr lie Phe Arg Gly Tyr Leu lie Pro 
370 375 380 



WO 01/14565 PCT/AU00/00966 



28- 



aag ggc aca gtc gta gtg cca act ctg gac tct gtt ttg tat gac aac 1200 
Lys Gly Thr Val Val Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn 
385 390 395 400 

caa gaa ttt cct gat cca gaa aag ttt aag cca gaa cac ttc ctg aat 1248 
Gin Glu Phe Pro Asp Pro Glu Lys Phe Lys Pro Glu His Phe Leu Asn 
405 410 415 

gaa aat gga aag ttc aag tac agt gac tat ttc aag cca ttt tec aca 1296 
Glu Asn Gly Lys Phe Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr 
420 425 430 

gga aaa cga gtg tgt get gga gaa ggc ctg get cgc atg gag ttg ttt 1344 
Gly Lys Arg Val Cys Ala Gly Glu Gly Leu Ala Arg Met Glu Leu Phe 
435 440 445 

ctt ttg ttg tgt gec att ttg cag cat ttt aat ttg aag cct etc gtt 13 92 
Leu Leu Leu Cys Ala lie Leu Gin His Phe Asn Leu Lys Pro Leu Val 
450 455 460 

gac cca aag gat ate gac etc age cct ata cat att ggg ttt ggc tgt 1440 
Asp Pro Lys Asp lie Asp Leu Ser Pro lie His lie Gly Phe Gly Cys 
465 470 475 480 

ate cca cca cgt tac aaa etc tgt gtc att ccc cgc tea 1479 
lie Pro Pro Arg Tyr Lys Leu Cys Val lie Pro Arg Ser 
485 490 



<210> 10 

<211> 493 

<212> PRT 

<213> Human cells 

<400> 10 

Met Ser Ala Leu Gly Val Thr Val Ala Leu Leu Val Trp Ala Ala Phe 
15 10 15 
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Leu Leu Leu Val Ser Met Trp Arg Gin Val His Ser Ser Trp Asn Leu 
20 25 30 

Pro Pro Gly Pro Phe Pro Leu Pro lie He Gly Asn Leu Phe Gin Leu 
35 40 45 

Glu Leu Lys Asn He Pro Lys Ser Phe Thr Arg Leu Ala Gin Arg Phe 
50 55 60 

Gly Pro Val Phe Thr Leu Tyr Val Gly Ser Gin Arg Met Val Val Met 
65 70 75 80 

His Gly Tyr Lys Ala Val Lys Glu Ala Leu Leu Asp Tyr Lys Asp Glu 
85 90 95 

Phe Ser Gly Arg Gly Asp Leu Pro Ala Phe His Ala His Arg Asp Arg 
100 105 110 

Gly He He Phe Asn Asn Gly Pro Thr Trp Lys Asp He Arg Arg Phe 
115 120 125 

Ser Leu Thr Thr Leu Arg Asn Tyr Gly Met Gly Lys Gin Gly Asn Glu 
130 135 140 

Ser Arg He Gin Arg Glu Ala His Phe Leu Leu Glu Ala Leu Arg Lys 
145 150 155 160 

Thr Gin Gly Gin Pro Phe Asp Pro Thr Phe Leu He Gly Cys Ala Pro 
165 170 175 

Cys Asn Val He Ala Asp He Leu Phe Arg Lys His Phe Asp Tyr Asn 
180 185 190 

Asp Glu Lys Phe Leu Arg Leu Met Tyr Leu Phe Asn Glu Asn Phe His 
195 200 205 



Leu Leu Ser Thr Pro Trp Leu Gin Leu Tyr Asn Asn Phe Pro Ser Phe 
210 215 220 
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Leu His Tyr Leu Pro Gly Ser His Arg Lys Val lie Lys Asn Val Ala 
225 230 235 240 

Glu Val Lys Glu Tyr Val Ser Glu Arg Val Lys Glu His His Gin Ser 
245 250 255 

Leu Asp Pro Asn Cys Pro Arg Asp Leu Thr Asp Cys Leu Leu Val Glu 
260 265 270 

Met Glu Lys Glu Lys His Ser Ala Glu Arg Leu Tyr Thr Met Asp Gly 
. 275 280 285 

lie Thr Val Thr Val Ala Asp Leu Phe Phe Ala Gly Thr Glu Thr Thr 
290 295 300 

Ser Thr Thr Leu Arg Tyr Gly Leu Leu lie Leu Met Lys Tyr Pro Glu 
305 310 315 320 

He Glu Glu Lys Leu His Glu Glu He Asp Arg Val He Gly Pro Ser 
325 330 335 

Arg He Pro Ala He Lys Asp Arg Gin Glu Met Pro Tyr Met Asp Ala 
340 345 350 

Val Val His Glu He Gin Arg Phe He Thr Leu Val Pro Ser Asn Leu 
355 360 365 

Pro His Glu Ala Thr Arg Asp Thr He Phe Arg Gly Tyr Leu He Pro 
370 375 380 

Lys Gly Thr Val Val Val Pro Thr Leu Asp Ser Val Leu Tyr Asp Asn 
385 390 395 400 

Gin Glu Phe Pro Asp Pro Glu Lys Phe Lys Pro Glu His Phe Leu Asn 
405 410 415 



Glu Asn Gly Lys Phe Lys Tyr Ser Asp Tyr Phe Lys Pro Phe Ser Thr 
420 425 430 
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Gly Lys Arg Val Cys Ala Gly Glu 
435 440 

Leu Leu Leu Cys Ala lie Leu Gin 
450 455 

Asp Pro Lys Asp lie Asp Leu Ser 
465 470 

lie Pro Pro Arg Tyr Lys Leu Cys 
485 



Gly Leu Ala Arg Met Glu Leu Phe 
445 

His Phe Asn Leu Lys Pro Leu Val 
460 

Pro lie His lie Gly Phe Gly Cys 
475 480 

Val lie Pro Arg Ser 
490 



<210> 11 

<211> 1419 

<212> DNA 

<213> Human cells 



<220> 

<221> CDS 

<222> (1)..(1416) 



<400> 11 

atg get aga cag age tct ggg aga 
Met Ala Arg Gin Ser Ser Gly Arg 
1 5 

cct etc cca gtg att gga aat ate 
Pro Leu Pro Val lie Gly Asn lie 
20 



gga aaa etc cct cct ggc cct act 48 
Gly Lys Leu Pro Pro Gly Pro Thr 
10 15 

eta cag ata gat att aag gat gtc 96 
Leu Gin lie Asp lie Lys Asp Val 
25 30 



age aaa tec tta ace aat etc tea aaa ate tat ggc cct gtg ttc act 144 
Ser Lys Ser Leu Thr Asn Leu Ser Lys lie Tyr Gly Pro Val Phe Thr 
35 40 45 



ctg tat ttt ggc ctg gaa cgc atg gtg gtg ctg cat gga tat gaa gtg 
Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr Glu Val 
50 55 60 



192 
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gtg aag gaa gcc ctg att gat ctt gga gag gag ttt tct gga aga ggc 240 
Val Lys Glu Ala Leu lie Asp Leu Gly Glu Glu Phe Ser Gly Arg Gly 
65 70 75 80 

cat ttc cca ctg get gaa aga get aac aga gga ttt gga ate gtt ttc 288 
His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly lie Val Phe 
85 90 95 . 

age aat gga aag aga tgg aag gag ate egg cgt ttc tec etc atg acg 336 
Ser Asn Gly Lys Arg Trp Lys Glu lie Arg Arg Phe Ser Leu Met Thr 
100 105 110 

ctg egg aat ttt ggg atg ggg aag agg age att gag gac cgt gtt caa 3 84 
Leu Arg Asn Phe Gly Met Gly Lys Arg Ser lie Glu Asp Arg Val Gin 
115 120 125 

gag gaa gcc cgc tgc ctt gtg gag gag ttg aga aaa ace aag get tea 432 
Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys Ala Ser 
130 135 140 

ccc tgt gat ccc act ttc ate ctg ggc tgt get ccc tgc aat gtg ate • 480 
Pro Cys Asp Pro Thr Phe lie Leu Gly Cys Ala Pro Cys Asn Val lie 
145 150 155 160 

tgc tec att att ttc cag aaa cgt ttc gat tat aaa gat cag caa ttt 528 
Cys Ser lie lie Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin Gin Phe 
165 170 175 

ctt aac ttg atg gaa aaa ttg aat gaa aac ate agg att gta age ace 576 
Leu Asn Leu Met Glu Lys Leu Asn Glu Asn lie Arg lie Val Ser Thr 
180 185 190 

ccc tgg ate cag ata tgc aat aat ttt ccc act ate att gat tat ttc 624 
Pro Trp lie Gin lie Cys Asn Asn Phe Pro Thr lie lie Asp Tyr Phe 
195 200 205 

ccg gga acc cat aac aaa tta ctt aaa aac ctt get ttt atg gaa agt 672 
Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met Glu Ser 
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210 215 220 



gat att ttg gag aaa gta aaa gaa cac caa gaa teg atg gac ate aac 720 
Asp lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp He Asn 
225 230 235 240 

aac cct egg gac ttt att gat tgc ttc ctg ate aaa atg gag aag gaa 768 
Asn Pro Arg Asp Phe He Asp Cys Phe Leu He Lys Met Glu Lys Glu 
245 250 255 

aag caa aac caa cag tct gaa ttc act att gaa aac ttg gta ate act 816 
Lys Gin Asn Gin Gin Ser Glu Phe Thr He Glu Asn Leu Val He Thr 
260 265 270 

gca get gac tta ctt gga get ggg aca gag aca aca age aca acc ctg 864 
Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu 
275 280 285 

aga tat get etc ctt etc ctg ctg aag cac cca gag gtc aca get aaa 912 
Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr Ala Lys 
290 295 300 

gtc cag gaa gag att gaa cgt gtc gtt ggc aga aac egg age ccc tgc 960 
Val Gin Glu Glu He Glu Arg Val Val Gly Arg Asn Arg Ser Pro Cys 
305 310 315 320 

atg cag gac agg ggc cac atg ccc tac aca gat get gtg gtg cac gag 1008 
Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val His Glu 
325 330 335 

gtc cag aga tac ate gac etc ate ccc acc age ctg ccc cat gca gtg 1056 
Val Gin Arg Tyr lie Asp Leu He Pro Thr Ser Leu Pro His Ala Val 
340 345 350 

acc tgt gac gtt aaa ttc aga aac tac etc att ccc aag ggc aca acc 1104 
Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu He Pro Lys Gly Thr Thr 
355 360 365 



ata tta act tec etc act tct gtg eta cat gac aac aaa gaa ttt ccc 



1152 
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lie Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu Phe Pro 
370 375 380 

aac cca gag atg ttt gac cct cgt cac ttt ctg gat gaa ggt gga aat 1200 

Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly Gly Asn 

385 390 395 400 

ttt aag aaa agt aac tac ttc atg cct ttc tea gca gga aaa egg att 1248 

Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys Arg lie 
405 410 415 

tgt gtg gga gag ggc ctg gec cgc atg gag ctg ttt tta ttc ctg acc 12 96 

Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe Leu Thr 
420 425 430 

ttc att tta cag aac ttt aac ctg aaa tct ctg att gac cca aag gac 1344 

Phe lie Leu Gin Asn Phe Asn Leu Lys Ser Leu lie Asp Pro Lys Asp 
435 440 445 

ctt gac aca act cct gtt gtc aat gga ttt get tct gtc ccg ccc ttc 13 92 

Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro Pro Phe 
450 455 460 

tat cag ctg tgc ttc att cct gtc tga 1419 

Tyr Gin Leu Cys Phe lie Pro Val 

465 470 



<210> 12 

<211> 472 

<212> PRT 

<213> Human cells 

<400> 12 

Met Ala Arg Gin Ser Ser Gly Arg Gly Lys Leu Pro Pro Gly Pro Thr 
15 10 15 

Pro Leu Pro Val He Gly Asn He Leu Gin He Asp He Lys Asp Val 
20 25 30 
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Ser Lys Ser Leu Thr Asn Leu Ser Lys lie Tyr Gly Pro Val Phe Thr 
35 40 45 

Leu Tyr Phe Gly Leu Glu Arg Met Val Val Leu His Gly Tyr Glu Val 
50 55 60 

Val Lys Glu Ala Leu He Asp Leu Gly Glu Glu Phe Ser Gly Arg Gly 
65 70 75 80 

His Phe Pro Leu Ala Glu Arg Ala Asn Arg Gly Phe Gly He Val Phe 
85 90 95 

Ser Asn Gly Lys Arg Trp Lys Glu He Arg Arg Phe Ser Leu Met Thr 
100 105 110 

Leu Arg Asn Phe Gly Met Gly Lys Arg Ser He Glu Asp Arg Val Gin 
115 120 125 

Glu Glu Ala Arg Cys Leu Val Glu Glu Leu Arg Lys Thr Lys Ala Ser 
130 135 140 

Pro Cys Asp Pro Thr Phe He Leu Gly Cys Ala Pro Cys Asn Val He 
145 150 155 160 

Cys Ser He He Phe Gin Lys Arg Phe Asp Tyr Lys Asp Gin Gin Phe 
165 170 175 

Leu Asn Leu Met Glu Lys Leu Asn Glu Asn He Arg lie Val Ser Thr 
180 185 190 

Pro Trp lie Gin lie Cys Asn Asn Phe Pro Thr lie lie Asp Tyr Phe 
195 200 205 



Pro Gly Thr His Asn Lys Leu Leu Lys Asn Leu Ala Phe Met Glu Ser 
210 215 220 



Asp lie Leu Glu Lys Val Lys Glu His Gin Glu Ser Met Asp He Asn 
225 230 235 240 



WO 01/14565 



PCT/AU00/00966 



-36- 



Asn Pro Arg Asp Phe lie Asp Cys Phe Leu lie Lys Met Glu Lys Glu 
245 250 255 

Lys Gin Asn Gin Gin Ser Glu Phe Thr lie Glu Asn Leu Val lie Thr 
260 265 270 

Ala Ala Asp Leu Leu Gly Ala Gly Thr Glu Thr Thr Ser Thr Thr Leu 
275 280 285 

Arg Tyr Ala Leu Leu Leu Leu Leu Lys His Pro Glu Val Thr Ala Lys 
290 295 300 

Val Gin Glu Glu lie Glu Arg Val Val Gly Arg Asn Arg Ser Pro Cys 
305 310 315 320 

Met Gin Asp Arg Gly His Met Pro Tyr Thr Asp Ala Val Val His Glu 
325 330 335 

Val Gin Arg Tyr lie Asp Leu lie Pro Thr Ser Leu Pro His Ala Val 
340 345 350 

Thr Cys Asp Val Lys Phe Arg Asn Tyr Leu lie Pro Lys Gly Thr Thr 
355 360 365 

lie Leu Thr Ser Leu Thr Ser Val Leu His Asp Asn Lys Glu Phe Pro 
370 375 380 

Asn Pro Glu Met Phe Asp Pro Arg His Phe Leu Asp Glu Gly Gly Asn 
385 390 395 400 

Phe Lys Lys Ser Asn Tyr Phe Met Pro Phe Ser Ala Gly Lys Arg lie 
405 410 415 

Cys Val Gly Glu Gly Leu Ala Arg Met Glu Leu Phe Leu Phe Leu Thr 
420 425 430 



Phe lie Leu Gin Asn Phe Asn Leu Lys Ser Leu lie Asp Pro Lys Asp 
435 440 445 
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Leu Asp Thr Thr Pro Val Val Asn Gly Phe Ala Ser Val Pro Pro Phe 
450 455 460 

Tyr Gin Leu Cys Phe lie Pro Val 
465 470 



<210> 13 
<211> 2034 
<212> DNA 
<213> Human cells 



<220> 

<221> CDS 

<222> (1) . . (2031) 



<400> 13 

atg get gac tec cac gtg gac ace age tec acc gtg tec gag gcg gtg 48 
Met Ala Asp Ser His Val Asp Thr Ser Ser Thr Val Ser Glu Ala Val 
15 10 15 



gee gaa gaa gta tct ctt ttc age atg acg gac atg att ctg ttt teg 96 
Ala Glu Glu Val Ser Leu Phe Ser Met Thr Asp Met lie Leu Phe Ser 
20 25 30 



etc ate gtg ggt etc eta acc tac tgg ttc etc ttc aga aag aaa aaa 144 
Leu lie Val Gly Leu Leu Thr Tyr Trp Phe Leu Phe Arg Lys Lys Lys 
35 40 45 

gaa gaa gtc ccc gag ttc acc aaa att cag aca ttg acc tec tct gtc 192 
Glu Glu Val Pro Glu Phe Thr Lys lie Gin Thr Leu Thr Ser Ser Val 
50 55 60 



aga gag age age ttt gtg gaa aag atg aag aaa acg ggg agg aac ate 240 
Arg Glu Ser Ser Phe Val Glu Lys Met Lys Lys Thr Gly Arg Asn He 
65 70 75 80 
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ate gtg ttc tac ggc tec cag acg ggg act gca gag gag ttt gec aac 288 
lie Val Phe Tyr Gly Ser Gin Thr Gly Thr Ala Glu Glu Phe Ala Asn 
85 90 95 

cgc ctg tec aag gac gec cac cgc tac ggg atg cga ggc atg tea gcg 336 
Arg Leu Ser Lys Asp Ala His Arg Tyr Gly Met Arg Gly Met Ser Ala 
100 105 110 

gac cct gag gag tat gac ctg gec gac ctg age age ctg cca gag ate 384 
Asp Pro Glu Glu Tyr Asp Leu Ala Asp Leu Ser Ser Leu Pro Glu lie 
115 120 125 

gac aac gee ctg gtg gtt ttc tgc atg gec acc tac ggt gag gga gac 432 
Asp Asn Ala Leu Val Val Phe Cys Met Ala Thr Tyr Gly Glu Gly Asp 
130 135 140 

ccc acc gac aat gee cag gac ttc tac gac tgg ctg cag gag aca gac 480 
Pro Thr Asp Asn Ala Gin Asp Phe Tyr Asp Trp Leu Gin Glu Thr Asp 
145 150 155 160 

gtg gat etc tct ggg gtc aag ttc gcg gtg ttt ggt ctt ggg aac aag 528 
Val Asp Leu Ser Gly Val Lys Phe Ala Val Phe Gly Leu Gly Asn Lys 
165 170 175 

acc tac gag cac ttc aat gee atg ggc aag tac gtg gac aag egg ctg 576 
Thr Tyr Glu His Phe Asn Ala Met Gly Lys Tyr Val Asp Lys Arg Leu 
180 185 190 

gag cag etc ggc gee cag cgc ate ttt gag ctg ggg ttg ggc gac gac 624 
Glu Gin Leu Gly Ala Gin Arg He Phe Glu Leu Gly Leu Gly Asp Asp 
195 200 205 

gat ggg aac ttg gag gag gac ttc ate acc tgg cga gag cag ttc tgg 672 
Asp Gly Asn Leu Glu Glu Asp Phe He Thr Trp Arg Glu Gin Phe Trp 
210 215 220 

ccg gec gtg tgt gaa cac ttt ggg gtg gaa gee act ggc gag gag tec 720 
Pro Ala Val Cys Glu His Phe Gly Val Glu Ala Thr Gly Glu Glu Ser 
225 230 235 240 
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age att cgc cag tac gag ctt gtg gtc cac acc gac ata gat gcg gec 768 
Ser lie Arg Gin Tyr Glu Leu Val Val His Thr Asp lie Asp Ala Ala 
245 250 255 

aag gtg tac atg ggg gag atg ggc egg ctg aag age tac gag aac cag 816 
Lys Val Tyr Met Gly Glu Met Gly Arg Leu Lys Ser Tyr Glu Asn Gin 
260 265 270 

aag ccc ccc ttt gat gee aag aat ccg ttc ctg get gca gtc acc acc 864 
Lys Pro Pro Phe Asp Ala Lys Asn Pro Phe Leu Ala Ala Val Thr Thr 
275 280 285 

aac egg aag ctg aac cag gga acc gag cgc cac etc atg cac ctg gaa 912 
Asn Arg Lys Leu Asn Gin Gly Thr Glu Arg His Leu Met His Leu Glu 
290 295 300 

ttg gac ate teg gac tec aaa ate agg tat gaa tct ggg gac cac gtg 960 
Leu Asp lie Ser Asp Ser Lys lie Arg Tyr Glu Ser Gly Asp His Val 
305 310 315 320 

get gtg tac cca gee aac gac tct get etc gtc aac cag ctg ggc aaa 100 8 
Ala Val Tyr Pro Ala Asn Asp Ser Ala Leu Val Asn Gin Leu Gly Lys 
325 330 335 

ate ctg ggt gee gac ctg gac gtc gtc atg tec ctg aac aac ctg gat 1056 
lie Leu Gly Ala Asp Leu Asp Val Val Met Ser Leu Asn Asn Leu Asp 
340 345 350 

gag gag tec aac aag aag cac cca ttc ccg tgc cct acg tec tac cgc 1104 
Glu Glu Ser Asn Lys Lys His Pro Phe Pro Cys Pro Thr Ser Tyr Arg 
355 360 365 

acg gee etc acc tac tac ctg gac ate acc aac ccg ccg cgt acc aac 1152 
Thr Ala Leu Thr Tyr Tyr Leu Asp lie Thr Asn Pro Pro Arg Thr Asn 
370 375 380 



gtg ctg tac gag ctg gcg cag tac gee teg gag ccc teg gag cag gag 
Val Leu Tyr Glu Leu Ala Gin Tyr Ala Ser Glu Pro Ser Glu Gin Glu 



1200 
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385 



390 



395 



400 



ctg ctg cgc aag atg gcc tec tec tec ggc gag ggc aag gag ctg tac 1248 
Leu Leu Arg Lys Met Ala Ser Ser Ser Gly Glu Gly Lys Glu Leu Tyr 
405 410 415 

ctg age tgg gtg gtg gag gcc egg agg cac ate ctg gcc ate ctg cag 1296 
Leu Ser Trp Val Val Glu Ala Arg Arg His lie Leu Ala lie Leu Gin 
420 425 430 

gac tgc ccg tec ctg egg ccc ccc ate gac cac ctg tgt gag ctg ctg 1344 
Asp Cys Pro Ser Leu Arg Pro Pro lie Asp His Leu Cys Glu Leu Leu 
435 440 445 

ccg cgc ctg cag gcc cgc tac tac tec ate gcc tea tec tec aag gtc 13 92 
Pro Arg Leu Gin Ala Arg Tyr Tyr Ser lie Ala Ser Ser Ser Lys Val 
450 455 460 

cac ccc aac tct gtg cac ate tgt gcg gtg gtt gtg gag tac gag ace 1440 
His Pro Asn Ser Val His lie Cys Ala Val Val Val Glu Tyr Glu Thr 
465 470 475 480 

aag gcc ggc cgc ate aac aag ggc gtg gcc ace aac tgg ctg egg gcc 1488 
Lys Ala Gly Arg He Asn Lys Gly Val Ala Thr Asn Trp Leu Arg Ala 
485 490 495 

aag gag cct gcc ggg gag aac ggc ggc cgt gcg ctg gtg ccc atg ttc 1536 
Lys Glu Pro Ala Gly Glu Asn Gly Gly Arg Ala Leu Val Pro Met Phe 
500 505 510 

gtg cgc aag tec cag ttc cgc ctg ccc ttc aag gcc ace acg cct gtc 1584 
Val Arg Lys Ser Gin Phe Arg Leu Pro Phe Lys Ala Thr Thr Pro Val 
515 520 525 

ate atg gtg ggc ccc ggc ace ggg gtg gca ccc ttc ata ggc ttc ate 1632 
He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe He Gly Phe He 
530 535 540 



cag gag egg gcc tgg ctg cga cag cag ggc aag gag gtg ggg gag acg 1680 
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Gin Glu Arg Ala Trp Leu Arg Gin Gin Gly Lys Glu Val Gly Glu Thr 
545 550 555 560 

ctg ctg tac tac ggc tgc cgc cgc teg gat gag gac tac ctg tac egg 1728 
Leu Leu Tyr Tyr Gly Cys Arg Arg Ser Asp Glu Asp Tyr Leu Tyr Arg 
565 570 575 

gag gac gtg gcg cag ttc cac agg gac ggt gcg etc acc cag etc aac 1776 
Glu Asp Val Ala Gin Phe His Arg Asp Gly Ala Leu Thr Gin Leu Asn 
580 585 590 

gtg gee ttc tec egg gag cag tec cac aag gtc tac gtc cag cac ctg 1824 
Val Ala Phe Ser Arg Glu Gin Ser His Lys Val Tyr Val Gin His Leu 
595 600 605 

eta aag caa gac cga gag cac ctg tgg aag ttg ate gaa ggc ggt gee 1872 
Leu Lys Gin Asp Arg Glu His Leu Trp Lys Leu lie Glu Gly Gly Ala 
610 615 ' 620 

cac ate tac gtc tgt ggg gat gca egg aac atg gee agg gat gtg cag 1920 
His lie Tyr Val Cys Gly Asp Ala Arg Asn Met Ala Arg Asp Val Gin 
625 630 635 640 

aac acc ttc tac gac ate gtg get gag etc ggg gee atg gag cac gcg 1968 
Asn Thr Phe Tyr Asp lie Val Ala Glu Leu Gly Ala Met Glu His Ala 
645 650 655 

cag gcg gtg gac tac ate aag aaa ctg atg acc aag ggc cgc tac tec 2016 
Gin Ala Val Asp Tyr lie Lys Lys Leu Met Thr Lys Gly Arg Tyr Ser 
660 665 670 

ctg gac gtg tgg age tag 2034 
Leu Asp Val Trp Ser 
675 



<210> 14 
<211> 677 
<212> PRT 
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<213> Human cells 
<400> 14 

Met Ala Asp Ser His Val Asp Thr Ser Ser Thr Val Ser Glu Ala Val 
15 10 15 

Ala Glu Glu Val Ser Leu Phe Ser Met Thr Asp Met He Leu Phe Ser 
20 25 30 

Leu He Val Gly Leu Leu Thr Tyr Trp Phe Leu Phe Arg Lys Lys Lys 
35 40 45 

Glu Glu Val Pro Glu Phe Thr Lys He Gin Thr Leu Thr Ser Ser Val 
50 55 60 

Arg Glu Ser Ser Phe Val Glu Lys Met Lys Lys Thr Gly Arg Asn He 
65 70 75 80 

He Val Phe Tyr Gly Ser Gin Thr Gly Thr Ala Glu Glu Phe Ala Asn 
85 90 ' 95 

Arg Leu Ser Lys Asp Ala His Arg Tyr Gly Met Arg Gly Met Ser Ala 
100 105 110 

Asp Pro Glu Glu Tyr Asp Leu Ala Asp Leu Ser Ser Leu Pro Glu He 
115 120 125 

Asp Asn Ala Leu Val Val Phe Cys Met Ala Thr Tyr Gly Glu Gly Asp 
130 135 140 

Pro Thr Asp Asn Ala Gin Asp Phe Tyr Asp Trp Leu Gin Glu Thr Asp 
145 150 155 160 

Val Asp Leu Ser Gly Val Lys Phe Ala Val Phe Gly Leu Gly Asn Lys 
165 170 175 



Thr Tyr Glu His Phe Asn Ala Met Gly Lys Tyr Val Asp Lys Arg Leu 
180 185 190 
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Glu Gin Leu Gly Ala Gin Arg lie Phe Glu Leu Gly Leu Gly Asp Asp 
195 200 205 

Asp Gly Asn Leu Glu Glu Asp Phe lie Thr Trp Arg Glu Gin Phe Trp 
210 215 220 

Pro Ala Val Cys Glu His Phe Gly Val Glu Ala Thr Gly Glu Glu Ser 
225 230 235 240 

Ser lie Arg Gin Tyr Glu Leu Val Val His Thr Asp lie Asp Ala Ala 
245 250 255 

Lys Val Tyr Met Gly Glu Met Gly Arg Leu Lys Ser Tyr Glu Asn Gin 
260 265 270 

Lys Pro Pro Phe Asp Ala Lys Asn Pro Phe Leu Ala Ala Val Thr Thr 
275 280 285 

Asn Arg Lys Leu Asn Gin Gly Thr Glu Arg His Leu Met His Leu Glu 
290 295 300 

Leu Asp lie Ser Asp Ser Lys He Arg Tyr Glu Ser Gly Asp His Val 
305 310 315 320 

Ala Val Tyr Pro Ala Asn Asp Ser Ala Leu Val Asn Gin Leu Gly Lys 
325 330 335 

He Leu Gly Ala Asp Leu Asp Val Val Met Ser Leu Asn Asn Leu Asp 
340 345 350 

Glu Glu Ser Asn Lys Lys His Pro Phe Pro Cys Pro Thr Ser Tyr Arg 
355 360 365 

Thr Ala Leu Thr Tyr Tyr Leu Asp He Thr Asn Pro Pro Arg Thr Asn 
370 375 380 



Val Leu Tyr Glu Leu Ala Gin Tyr Ala Ser Glu Pro Ser Glu Gin Glu 
385 390 395 400 
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Leu Leu Arg Lys Met Ala Ser Ser Ser Gly Glu Gly Lys Glu Leu Tyr 
405 410 415 

Leu Ser Trp Val Val Glu Ala Arg Arg His lie Leu Ala lie Leu Gin 
420 425 430 

Asp Cys Pro Ser Leu Arg Pro Pro lie Asp His Leu Cys Glu Leu Leu 
435 440 445 

Pro Arg Leu Gin Ala Arg Tyr Tyr Ser lie Ala Ser Ser Ser Lys Val 
450 455 460 

His Pro Asn Ser Val His He Cys Ala Val Val Val Glu Tyr Glu Thr 
465 470 ■ 475 480 

Lys Ala Gly Arg He Asn Lys Gly Val Ala Thr Asn Trp Leu Arg Ala 
485 490 495 

Lys Glu Pro Ala Gly Glu Asn Gly Gly Arg Ala Leu Val Pro Met Phe 
500 505 510 

Val Arg Lys Ser Gin Phe Arg Leu Pro Phe Lys Ala Thr Thr Pro Val 
515 520 525 

He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe He Gly Phe He 
530 535 540 

Gin Glu Arg Ala Trp Leu Arg Gin Gin Gly Lys Glu Val Gly Glu Thr 
545 550 555 560 

Leu Leu Tyr Tyr Gly Cys Arg Arg Ser Asp Glu Asp Tyr Leu Tyr Arg 
565 570 575 

Glu Asp Val Ala Gin Phe His Arg Asp Gly Ala Leu Thr Gin Leu Asn 
580 585 590 



Val Ala Phe Ser Arg Glu Gin Ser His Lys Val Tyr Val Gin His Leu 
595 600 605 
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Leu Lys Gin Asp Arg Glu His Leu Trp Lys Leu lie Glu Gly Gly Ala 
610 615 620 

His lie Tyr Val Cys Gly Asp Ala Arg Asn Met Ala Arg Asp Val Gin 
625 630 635 640 

Asn Thr Phe Tyr Asp He Val Ala Glu Leu Gly Ala Met Glu His Ala 
645 650 655 

Gin Ala Val Asp Tyr He Lys Lys Leu Met Thr Lys Gly Arg Tyr Ser 
660 665 670 



Leu Asp Val Trp Ser 
675 



<210> 15 
<211> 2034 
<212> DNA 
<213> Human cells 

<220> 

<221> CDS 

<222> (1) . . (2031) 

<400> 15 

atg gga gac tec cac gtg gac acc age 
Met Gly Asp Ser His Val Asp Thr Ser 
1 5 

gee gaa gaa gta tct ctt ttc age atg 
Ala Glu Glu Val Ser Leu Phe Ser Met 
20 25 

etc ate gtg ggt etc eta acc tac tgg 
Leu He Val Gly Leu Leu Thr Tyr Trp 
35 40 



tec acc gtg tec gag gcg gtg 4 8 
Ser Thr Val Ser Glu Ala Val 
10 15 

acg gac atg att ctg ttt teg 96 
Thr Asp Met He Leu Phe Ser 
30 

ttc etc ttc aga aag aaa aaa 144 
Phe Leu Phe Arg Lys Lys Lys 
45 
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gaa gaa gtc ccc gag ttc acc aaa att cag aca ttg acc tec tct gtc 192 
Glu Glu Val Pro Glu Phe Thr Lys lie Gin Thr Leu Thr Ser Ser Val 
50 55 60 

aga gag age age ttt gtg gaa aag atg aag aaa acg ggg, agg aac ate 24 0 
Arg Glu Ser Ser Phe Val Glu Lys Met Lys Lys Thr Gly Arg Asn lie 
65 70 75 80 

ate gtg ttc tac ggc tec cag acg ggg act gca gag gag ttt gee aac 288 
He Val Phe Tyr Gly Ser Gin Thr Gly Thr Ala Glu Glu Phe Ala Asn 
85 90 95 

cgc ctg tec aag gac gee cac cgc tac ggg atg cga ggc atg tea gcg 336 
Arg Leu Ser Lys Asp Ala His Arg Tyr Gly Met Arg Gly Met Ser Ala 
100 105 110 

gac cct gag gag tat gac ctg gee gac ctg age age ctg cca gag ate 384 
Asp Pro Glu Glu Tyr Asp Leu Ala Asp Leu Ser Ser Leu Pro Glu He 
115 120 125 

gac aac gee ctg gtg gtt ttc tgc atg gee acc tac ggt gag gga gac 432 
Asp Asn Ala Leu Val Val Phe Cys Met Ala Thr Tyr Gly Glu Gly Asp 
130 135 140 

ccc acc gac aat gee cag gac ttc tac gac tgg ctg cag gag aca gac 480 
Pro Thr Asp Asn Ala Gin Asp Phe Tyr Asp Trp Leu Gin Glu Thr Asp 
145 150 155 160 

gtg gat etc tct ggg gtc aag ttc gcg gtg ttt ggt ctt ggg aac aag 528 
Val Asp Leu Ser Gly Val Lys Phe Ala Val Phe Gly Leu Gly Asn Lys 
165 170 175 

acc tac gag cac ttc aat gee atg ggc aag tac gtg gac aag egg ctg 576 
Thr Tyr Glu His Phe Asn Ala Met Gly Lys Tyr Val Asp Lys Arg Leu 
180 185 190 

gag cag etc ggc gee cag cgc ate ttt gag ctg ggg ttg ggc gac gac 624 
Glu Gin Leu Gly Ala Gin Arg He Phe Glu Leu Gly Leu Gly Asp Asp 
195 200 205 
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gat ggg aac 
Asp Gly Asn 
210 

ccg gcc gtg 
Pro Ala Val 
225 

age att cgc 
Ser* lie Arg 

aag gtg tac 
Lys Val Tyr 

aag ccc ccc 
Lys Pro Pro 
275 

aac egg aag 
Asn Arg Lys 
290 

ttg gac ate 
Leu Asp lie 
305 

get gtg tac 
Ala Val Tyr 



ate ctg ggt 
lie Leu Gly 



gag gag tec 
Glu Glu Ser 



ttg gag gag gac 
Leu Glu Glu Asp 
215 

tgt gaa cac ttt 
Cys Glu His Phe 
230 

cag tac gag ctt 
Gin Tyr Glu Leu 
245 

atg ggg gag at g 

Met Gly Glu Met 
260 

ttt gat gcc aag 
Phe Asp Ala Lys 



ctg aac cag gga 
Leu Asn Gin Gly 
295 

teg gac tec aaa 
Ser Asp Ser Lys 
310 

cca gcc aac gac 
Pro Ala Asn Asp 
325 

gcc gac ctg gac 
Ala Asp Leu Asp 
340 

aac aag aag cac 
Asn Lys Lys His 



ttc ate ace 
Phe He Thr 



ggg gtg gaa 
Gly Val Glu 



gtg gtc cac 
Val Val His 
250 

ggc egg ctg 
Gly Arg Leu 
265 

aat ccg ttc 
Asn Pro Phe 
280 

ace gag cgc 
Thr Glu Arg 



ate agg tat 
He Arg Tyr 

tct get etc 
Ser Ala Leu 
330 

gtc gtc atg 
Val Val Met 
345 

cca ttc ccg 
Pro Phe Pro 



tgg cga gag cag 
Trp Arg Glu Gin 
220 

gcc act ggc gag 
Ala Thr Gly Glu 
235 

ace gac ata gat 
Thr Asp He Asp 

aag age tac gag 
Lys Ser Tyr Glu 
270 

ctg get gca gtc 
Leu Ala Ala Val 
285 

cac etc atg cac 
His Leu Met His 
300 

gaa tct ggg gac 
Glu Ser Gly Asp 
315 

gtc aac cag ctg 
Val Asn Gin Leu 



tec ctg aac aac 
Ser Leu Asn Asn 
350 

tgc cct acg tec 
Cys Pro Thr Ser 



ttc tgg 672 
Phe Trp 

gag tec 720 
Glu Ser 
240 

gcg gcc 768 
Ala Ala 
255 

aac cag 816 
Asn Gin 

ace ace 864 
Thr Thr 



ctg gaa 912 
Leu Glu 



cac gtg 960 
His Val 
320 

ggc aaa 1008 

Gly Lys 

335 

ctg gat 1056 
Leu Asp 

tac cgc 1104 
Tyr Arg 
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355 360 365 

acg gcc etc acc tac tac ctg gac ate ace aac ccg ccg cgt ace aac 1152 
Thr Ala Leu Thr Tyr Tyr Leu Asp lie Thr Asn Pro Pro Arg Thr Asn 
370 375 380 

gtg ctg tac gag ctg gcg cag tac gcc teg gag ccc teg gag cag gag 1200 
Val Leu Tyr Glu Leu Ala Gin Tyr Ala Ser Glu Pro Ser Glu Gin Glu 
385 390 395 400 

ctg ctg cgc aag atg gcc tec tec tec ggc gag ggc aag gag ctg tac 1248 
Leu Leu Arg Lys Met Ala Ser Ser Ser Gly Glu Gly Lys Glu Leu Tyr 
405 410 415 

ctg age tgg gtg gtg gag gcc egg agg cac ate ctg gcc ate ctg cag 12 96 
Leu Ser Trp Val Val Glu Ala Arg Arg His He Leu Ala He Leu Gin 
420 425 430 

gac tgc ccg tec ctg egg ccc ccc ate gac cac ctg tgt gag ctg ctg 1344 
Asp Cys Pro Ser Leu Arg Pro Pro He Asp His Leu Cys Glu Leu Leu 
435 440 445 

ccg cgc ctg cag gcc cgc tac tac tec ate gcc tea tec tec aag gtc 1392 
Pro Arg Leu Gin Ala Arg Tyr Tyr Ser He Ala Ser Ser Ser Lys Val 
450 455 460 

cac ccc aac tct gtg cac ate tgt gcg gtg gtt gtg gag tac gag acc 1440 
His Pro Asn Ser Val His He Cys Ala Val Val Val Glu Tyr Glu Thr 
465 470 475 480 

aag gcc ggc cgc ate aac aag ggc gtg gcc acc aac tgg ctg egg gcc 14 88 
Lys Ala Gly Arg He Asn Lys Gly Val Ala Thr Asn Trp Leu Arg Ala 
485 490 495 

aag gag cct gcc ggg gag aac ggc ggc cgt gcg ctg gtg ccc atg ttc 1536 
Lys Glu Pro Ala Gly Glu Asn Gly Gly Arg Ala Leu Val Pro Met Phe 
500 505 510 



gtg cgc aag tec cag ttc cgc ctg ccc ttc aag gcc acc acg cct gtc 



1584 
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Val Arg Lys Ser Gin Phe Arg Leu Pro Phe Lys Ala Thr Thr Pro Val 
515 520 525 

ate atg gtg ggc ccc ggc acc ggg gtg gca ccc ttc ata ggc ttc ate 1632 
lie Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe He Gly Phe lie 
530 535 540 



cag gag egg gee tgg ctg cga cag cag ggc aag gag gtg ggg gag acg 1680 
Gin Glu Arg Ala Trp Leu Arg Gin Gin Gly Lys Glu Val Gly Glu Thr 
545 550 555 560 



ctg ctg tac tac ggc tgc cgc cgc teg gat gag gac tac ctg tac egg 1728 
Leu Leu Tyr Tyr Gly Cys Arg Arg Ser Asp Glu Asp Tyr Leu Tyr Arg 
565 570 575 



gag gac gtg gcg cag ttc cac agg gac ggt gcg etc acc cag etc aac 1776 
Glu Asp Val Ala Gin Phe His Arg Asp Gly Ala Leu Thr Gin Leu Asn 
580 585 590 



gtg gee ttc tec egg gag 
Val Ala Phe Ser Arg Glu 
595 

eta aag caa gac cga gag 
Leu Lys Gin Asp Arg Glu 
610 



cag tec cac aag gtc tac 
Gin Ser His Lys Val Tyr 
600 

cac ctg tgg aag ttg ate 
His Leu Trp Lys Leu He 
615 620 



gtc cag cac ctg 1824 

Val Gin His Leu 

605 

gaa ggc ggt gee 1872 
Glu Gly Gly Ala 



cac ate tac gtc tgt ggg gat gca egg aac atg gee agg gat gtg cag 1920 
His He Tyr Val Cys Gly Asp Ala Arg Asn Met Ala Arg Asp Val Gin 
625 630 635 640 



aac acc ttc tac gac ate gtg get gag etc ggg gec atg gag cac gcg 1968 
Asn Thr Phe Tyr Asp He Val Ala Glu Leu Gly Ala Met Glu His Ala 
645 650 655 



cag gcg gtg gac tac ate aag aaa ctg atg acc aag ggc cgc tac tec 2016 
Gin Ala Val Asp Tyr He Lys Lys Leu Met Thr Lys Gly Arg Tyr Ser 
660 665 670 
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ctg gac gtg tgg age tag 
Leu Asp Val Trp Ser 
675 



2034 



<210> 16 

<211> 677 

<212> PRT 

<213> Human cells 

<400> 16 

Met Gly Asp Ser His Val Asp Thr Ser Ser Thr Val Ser Glu Ala Val 
15 10 15 

Ala Glu Glu Val Ser Leu Phe Ser Met Thr Asp Met lie Leu Phe Ser 
20 25 30 

Leu lie Val Gly Leu Leu Thr Tyr Trp Phe Leu Phe Arg Lys Lys Lys 
35 40 45 

Glu Glu Val Pro Glu Phe Thr Lys lie Gin Thr Leu Thr Ser Ser Val 
50 55 60 

Arg Glu Ser Ser Phe Val Glu Lys Met Lys Lys Thr Gly Arg Asn lie 
65 70 75 80 

He Val Phe Tyr Gly Ser Gin Thr Gly Thr Ala Glu Glu Phe Ala Asn 
85 90 95 

Arg Leu Ser Lys Asp Ala His Arg Tyr Gly Met Arg Gly Met Ser Ala 
100 # 105 110 

Asp Pro Glu Glu Tyr Asp Leu Ala Asp Leu Ser Ser Leu Pro Glu He 
115 120 125 

Asp Asn Ala Leu Val Val Phe Cys Met Ala Thr Tyr Gly Glu Gly Asp 
130 135 140 



Pro Thr Asp Asn Ala Gin Asp Phe Tyr Asp Trp Leu Gin Glu Thr Asp 
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145 



150 



155 



160 



Val Asp Leu Ser Gly Val Lys Phe Ala Val Phe Gly Leu Gly Asn Lys 
165 170 175 

Thr Tyr Glu His Phe Asn Ala Met Gly Lys Tyr Val Asp Lys Arg Leu 
180 185 190 

Glu Gin Leu Gly Ala Gin Arg lie Phe Glu Leu Gly Leu Gly Asp Asp 
195 200 205 

Asp Gly Asn Leu Glu Glu Asp Phe lie Thr Trp Arg Glu Gin Phe Trp 
210 215 220 

Pro Ala Val Cys Glu His Phe Gly Val Glu Ala Thr Gly Glu Glu Ser 
225 230 235 240 

Ser He Arg Gin Tyr Glu Leu Val Val His Thr Asp He Asp Ala Ala 
245 250 255 

Lys Val Tyr Met Gly Glu Met Gly Arg Leu Lys Ser Tyr Glu Asn Gin 
260 265 270 

Lys Pro Pro Phe Asp Ala Lys Asn Pro Phe Leu Ala Ala Val Thr Thr 
275 280 285 

Asn Arg Lys Leu Asn Gin Gly Thr Glu Arg His Leu Met His Leu Glu 
290 295 300 

Leu Asp He Ser Asp Ser Lys He Arg Tyr Glu Ser Gly Asp His Val 
305 310 315 320 

Ala Val Tyr Pro Ala Asn Asp Ser Ala Leu Val Asn Gin Leu Gly Lys 
325 330 335 



He Leu Gly Ala Asp Leu Asp Val Val Met Ser Leu Asn Asn Leu Asp 
340 345 350 



Glu Glu Ser Asn Lys Lys His Pro Phe Pro Cys Pro Thr Ser Tyr Arg 
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355 



360 



365 



Thr Ala Leu Thr Tyr Tyr Leu Asp He Thr Asn Pro Pro Arg Thr Asn 
370 375 380 

Val Leu Tyr Glu Leu Ala Gin Tyr Ala Ser Glu Pro Ser Glu Gin Glu 
385 390 395 400 

Leu Leu Arg Lys Met Ala Ser Ser Ser Gly Glu Gly Lys Glu Leu Tyr 
405 410 415 

Leu Ser Trp Val Val Glu Ala Arg Arg His He Leu Ala He Leu Gin 
420 425 430 

Asp Cys Pro Ser Leu Arg Pro Pro He Asp His Leu Cys Glu Leu Leu 
435 440 445 

Pro Arg Leu Gin Ala Arg Tyr Tyr Ser He Ala Ser Ser Ser Lys Val 
450 455 460 

His Pro Asn Ser Val His He Cys Ala Val Val Val Glu Tyr Glu Thr 
465 470 475 480 

Lys Ala Gly Arg He Asn Lys Gly Val Ala Thr Asn Trp Leu Arg Ala 
485 490 495 

Lys Glu Pro Ala Gly Glu Asn Gly Gly Arg Ala Leu Val Pro Met Phe 
500 505 510 

Val Arg Lys Ser Gin Phe Arg Leu Pro Phe Lys Ala Thr Thr Pro Val 
515 520 525 

He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe He Gly Phe He 
530 535 540 



Gin Glu Arg Ala Trp Leu Arg Gin Gin Gly Lys Glu Val Gly Glu Thr 
545 550 555 560 



Leu Leu Tyr Tyr Gly Cys Arg Arg Ser Asp Glu Asp Tyr Leu Tyr Arg 
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565 570 575 

Glu Asp Val Ala Gin Phe His Arg Asp Gly Ala Leu Thr Gin Leu Asn 
580 585 590 

Val Ala Phe Ser Arg Glu Gin Ser His Lys Val Tyr Val Gin His Leu 
595 600 605 

Leu Lys Gin Asp Arg Glu His Leu Trp Lys Leu lie Glu Gly Gly Ala 
610 615 620 

His lie Tyr Val Cys Gly Asp Ala Arg Asn Met Ala Arg Asp Val Gin 
625 630 635 640 

Asn Thr Phe Tyr Asp lie Val Ala Glu Leu Gly Ala Met Glu His Ala 
645 650 655 

Gin Ala Val Asp Tyr He Lys Lys Leu Met Thr Lys Gly Arg Tyr Ser 
660 665 670 

Leu Asp Val Trp Ser 
675 



<210> 17 
<211> 8537 
<212> DNA 
<213> Human cells 

<400> 17 

atggctgctt caggaatgtt attagtggcc 
atgtctgttt ggcagcagag gaagagcaag 
cccttcattg gaaactacct gcagctgaac 
atcagtgagc gctatggccc cgtgttcacc 
ctgtgtggac atgatgccgt cagggaggct 
cgaggcgagc aagccacctt cgactgggtc 
ggggagcgcg ccaagcagct ccggcgcttc 
ggcaagcgag gcatcgagga gcgcatccag 



ttgctggtct gcctgactgt gatggtcttg 60 
gggaagctgc ctccgggacc caccccattg 120 
acagagcaga tgtacaactc cctcatgaag 180 
attcacttgg ggccccggcg ggtcgtggtg 240 
ctggtggacc aggctgagga gttcagcggg 300 
ttcaaaggct atggcgtggt attcagcaac 360 
tccatcgcca ccctgcggga cttcggggtg 420 
gaggaggcgg gcttcctcat cgacgccctc 4 80 
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cggggcactg gcggcgccaa tatcgatccc accttcttcc tgagccgcac agtctccaat 540 
gtcatcagct ccattgtctt tggggaccgc tttgactata aggacaaaga gttcctgtca 600 
ctgttgcgca tgatgctagg aatcttccag ttcacgtcaa cctccacggg gcagctctat 660 
gagatgttct cttcggtgat gaaacacctg ccaggaccac agcaacaggc ctttcagttg 720 
ctgcaagggc tggaggactt catagccaag aaggtggagc acaaccagcg cacgctggat 780 
cccaattccc cacgggactt cattgactcc tttctcatcc gcatgcagga ggaggagaag 840 
aaccccaaca cggagttcta cttgaaaaac ctggtgatga ccacgttgaa cctcttcatt 900 
gggggcaccg agaccgtcag caccaccctg cgctatggct tcttgctgct catgaagcac 960 
ccagaggtgg aggccaaggt ccatgaggag attgacagag tgatcggcaa gaaccggcag 1020 
cccaagtttg aggaccgggc caagatgccc tacatggagg cagtgatcca cgagatccaa 1080 
agatttggag acgtgatccc catgagtttg gcccgcagag tcaaaaagga caccaagttt 1140 
cgggatttct tcctccctaa gggcaccgaa gtgtacccta tgctgggctc tgtgctgaga 12 00 
gaccccagtt tcttctccaa cccccaggac ttcaatcccc agcacttcct gaatgagaag 12 60 
gggcagttta agaagagtga tgcttttgtg cccttttcca tcggaaagcg gaactgtttc 1320 
ggagaaggcc tggccagaat ggagctcttt ctcttcttca ccaccgtcat gcagaacttc 1380 
cgcctcaagt cctcccagtc acctaaggac attgacgtgt cccccaaaca cgtgggcttt 144 0 
gccacgatcc cacgaaacta caccatgagc ttcctgcccc gctgagtcga cctgcagccc 1500 
aagcttccgg tctagaaata attttgttta actttaagaa ggagatataa atatggctga 1560 
ctcccacgtg gacaccagct ccaccgtgtc cgaggcggtg gccgaagaag tatctctttt 1620 
cagcatgacg gacatgattc tgttttcgct catcgtgggt ctcctaacct actggttcct 1680 
cttcagaaag aaaaaagaag aagtccccga gttcaccaaa attcagacat tgacctcctc 174 0 
tgtcagagag agcagctttg tggaaaagat gaagaaaacg gggaggaaca tcatcgtgtt 1800 
ctacggctcc cagacgggga ctgcagagga gtttgccaac cgcctgtcca aggacgccca 1860 
ccgctacggg atgcgaggca tgtcagcgga ccctgaggag tatgacctgg ccgacctgag 1920 
cagcctgcca gagatcgaca acgccctggt ggttttctgc atggccacct acggtgaggg 1980 
agaccccacc gacaatgccc aggacttcta cgactggctg caggagacag acgtggatct 204 0 
ctctggggtc aagttcgcgg tgtttggtct tgggaacaag acctacgagc acttcaatgc 2100 
catgggcaag tacgtggaca agcggctgga gcagctcggc gcccagcgca tctttgagct 2160 
ggggttgggc gacgacgatg ggaacttgga ggaggacttc atcacctggc gagagcagtt 222 0 
ctggccggcc gtgtgtgaac actttggggt ggaagccact ggcgaggagt ccagcattcg 2280 
ccagtacgag cttgtggtcc acaccgacat agatgcggcc aaggtgtaca tgggggagat 2340 
gggccggctg aagagctacg agaaccagaa gccccccttt gatgccaaga atccgttcct 24 00 
ggctgcagtc accaccaacc ggaagctgaa ccagggaacc gagcgccacc tcatgcacct 2460 
ggaattggac atctcggact ccaaaatcag gtatgaatct ggggaccacg tggctgtgta 252 0 
cccagccaac gactctgctc tcgtcaacca gctgggcaaa atcctgggtg ccgacctgga 2580 
cgtcgtcatg tccctgaaca acctggatga ggagtccaac aagaagcacc cattcccgtg 2640 
ccctacgtcc taccgcacgg ccctcaccta ctacctggac atcaccaacc cgccgcgtac 2700 
caacgtgctg tacgagctgg cgcagtacgc ctcggagccc tcggagcagg agctgctgcg 2760 
caagatggcc tcctcctccg gcgagggcaa ggagctgtac ctgagctggg tggtggaggc 2820 
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ccggaggcac atcctggcca tcctgcagga 
cctgtgtgag ctgctgccgc gcctgcaggc 
ggtccacccc aactctgtgc acatctgtgc 
ccgcatcaac aagggcgtgg ccaccaactg 
cggcggccgt gcgctggtgc ccatgttcgt 
ggccaccacg cctgtcatca tggtgggccc 
catccaggag cgggcctggc tgcgacagca 
ctacggctgc cgccgctcgg atgaggacta 
cagggacggt gcgctcaccc agctcaacgt 
ctacgtccag cacctgctaa agcaagaccg 
tgcccacatc tacgtctgtg gggatgcacg 
ctacgacatc gtggctgagc tcggggccat 
gaaactgatg accaagggcc gctactccct 
taagctgtca aacatgagca gatctgagcc 
tgcttgaaga cgaaagggcc tcgtgatacg 
aatggtttct tagcgtcaaa gcaaccatag 
gcgggtgtgg tggttacgcg cagcgtgacc 
cctttcgctt tcttcccttc ctttctcgcc 
aatcgggggc tccctttagg gttccgattt 
cttgatttgg gtgatggttc acgtagtggg 
ttgacgttgg agtccacgtt ctttaatagt 
aaccctatct cgggctattc ttttgattta 
ttaaaaaatg agctgattta acaaaaattt 
acaatttcag gtggcacttt tcggggaaat 
taaatacatt caaatatgta tccgctcatg 
tattgaaaaa ggaagagtat gagtattcaa 
gcggcatttt gccttcctgt ttttgctcac 
gaagatcagt tgggtgcacg agtgggttac 
cttgagagtt ttcgccccga agaacgtttt 
tgtggcgcgg tattatcccg tgttgacgcc 
tattctcaga atgacttggt tgagtactca 
atgacagtaa gagaattatg cagtgctgcc 
ttacttctga caacgatcgg aggaccgaag 
gatcatgtaa ctcgccttga tcgttgggaa 
gagcgtgaca ccacgatgcc tgcagcaatg 
gaactactta ctctagcttc ccggcaacaa 
gcaggaccac ttctgcgctc ggcccttccg 
gccggtgagc gtgggtctcg cggtatcatt 
cgtatcgtag ttatctacac gacggggagt 
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ctgcccgtcc 
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aacgcgaatt 
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catttccgtg 
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ccagaaacgc 
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atcgaactgg 
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ccaatgatga gcacttttaa 
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4 RfiO 


gggcaagagc 


aactcggtcg 
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ccagtcacag 


aaaagcatct 


uduyyduyyu. 


4 RO 


ataaccatga 


gtgataacac 


tgcggccaac 


4740 


gagctaaccg 


cttttttgca 


caacatgggg 


4800 


ccggagctga 


atgaagccat 


accaaacgac 


4860 


gcaacaacgt 


tgcgcaaact 


attaactggc 


4920 


ttaatagact 


ggatggaggc 


ggataaagtt 


4980 


gctggctggt 


ttattgctga 


taaatctgga 


5040 


gcagcactgg ggccagatgg 


taagccctcc 


5100 


caggcaacta 


tggatgaacg 


aaatagacag 


5160 
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atcgctgaga 


taggtgcctc 


actgattaag 


cattggtaac 


uy ttay ctv-Ccl 






tatatacttt 


agattgattt 


aaaacttcat 


ttttaattta 


cictciy y a C C t a 


y y ty clcly cL u C 




ctttttgata atctcatgac 


caaaatccct 


taacgtgagt 


tttcgttcca 


#"•4- i"T a r% <■» ♦» a 

ctgagcyLCa 




gaccccgtag 


aaaagatcaa 


aggatcttct 


tgagatcctt 


tttttctgcg 






tgcttgcaaa 


caaaaaaacc 


accgctacca gcggtggttt 


gtttgccgga 


L. Lady ay C Ua 


JlDU 


ccaactcttt 


ttccgaaggt 


aactggcttc 


agcagagcgc 


agataccaaa 






ctagtgtagc 


cgtagttagg 


ccaccacttc 


aagaactctg 


tagcaccgcc 




JJOU 


gctctgctaa 


tcctgttacc 


agtggctgct 


gccagtggcg 


ataagtcgtg 


t~ a rennet 

lul Lauuyyy 




ttggacccaa 


gacgatagtt 


accggataag gcgcagcggt 


cgggctgaac 


ggyygg ttc g 


^7on 

J / u w 


tgcacacagc 


ccagcttgga 


gcgaacgacc tacaccgaac tgagatacct 


a r* a rtrnf nan 
auaywy uyay 


^7£0 


ctatgagaaa gcgccacgct 


tcccgaaggg 


agaaaggcgg 


acaggtatcc 


ggcaayegge 


^82 0 


agggtcggaa 


caggagagcg 


cacgagggag 


cttccagggg gaaacgcctg 


ytdLCCLCdC 


C Q Q n 
JOOU 


agtcctgtcg ggtttcgcca 


cctctgactt 


gagcgtcgat 


ttttgtgatg 


ctcy LCdggg 


O -7 ft V 


gggcggagcc 


tatggaaaaa 


cgccagcaac 


gcggcctttt 


tacggttcct 


ggcctLLcgc 


ouuu 


tggccttttg 


ctcacatgtt 


ctttcctgcg 


ttatcccctg 


attctgtgga 


haaprnt' a t* t~ 
LaaLLvjLalL 


£O£0 

Ouuu 


accgcctttg 


agtgagctga 


taccgctcgc 


cgcagccgaa 


cgaccgagcg 


Cagcydyuca 


D ± Z U 


gtgagcgagg 


aagcggaaga 


gcgcctgatg 


cggtattttc 


tccttacgca 


tctgtgcggL 


f 1 Qft 
DXOU 


atttcacacc 


gcatatatgg 


tgcactctca 


gtacaatctg 


ctctgatgcc 


ri a ^ a ^ a a 

yCatayLCad 


£Od n 


gccagtatac 


actccgctat 


cgctacgtga 


ctgggtcatg gctgcgcccc 


^ra a ^ nnn /™» 
yaCaLLLy LL 


conn 


aacacccgct 


gacgcgccct 


gacgggcttg 


tctgctcccg gcatccgctt 


a r* a <™i a /■» a a ^* 
aCcigdCadgC 


DJDU 


tgtgaccgtc 


tccgggagct 


gcatgtgtca gaggttttca 


ccgtcatcac 


cgaaacgcgc 


Oft z u 


gaggcagaac 


gccatcaaaa 


ataattcgcg 


tctggccttc 


ctgtagccag 


cttt catcaa 


o4 oU 


cattaaatgt 


gagcgagtaa 


caacccgtcg gattctccgt 


gggaacaaac 


ggcggattga 


dd4 U 


ccgtaatggg 


ataggttacg 


ttggtgtaga 


tgggcgcatc 


gtaaccgtgc 


acctgccagL 


DO uu 


ttgaggggac 


gacgacagta 


tcggcctcag gaagatcgca 


ctccagccag 


cut LucygLd 


DOOu 


ccgcttctgg 


tgccggaaac 


caggcaaagc 


gccattcgcc 


attcaggctg 


cgcadCLgcc 


D / Z U 


gggaagggcg 


atcggtgcgg 


gcctcttcgc 


tattacgcca 


gctggcgaaa 


ggg9g ac 9 c g 


o / o u 


ctgcaaggcg 


attaagttgg 


gtaacgccag ggttttccca gtcacgacgt 


4— ^t+- a a a a /"»/~» a 
tgtddddtgd 


O Oft u 


cggccagtga atccgtaatc 


atggtcatag 


ctgtttcctg 


tgtgaaattg 


4- ^ ^ ^ ^ /^t ^ 4" 

LLdUCCyCtC 




acaattccac 


acaacatacg 


agccggaagc 


ataaagtgta 


aagcctgggg 


CyCCLddtgd 


£Q£0 


gtgagctaac 


tcacattaat 


tgcgttgcgc 


tcactgcccg 


ctttccagtc 


y yy dddCCLy 




tcgtgccagc 


tgcattaatg 


aatcggccaa 


cgcgcgggga 


gaggcggttt 


gcgtattggg 


7080 


cgccagggtg 


gtttttcttt 


tcaccagtga gacgggcaac 


agctgattgc 


ccttcaccgc 


7140 


ctggccctga 


gagagttgca 


gcaagcggtc 


cacgctggtt 


tgccccagca 


ggegaaaate 


7200 


ctgtttgatg gtggttaacg 


gcgggatata 


acatgagctg 


tcttcggtat 


cgtcgtatcc 


7260 


cactaccgag 


atatccgcac 


caacgcgcag 


cccggactcg gtaatggcgc 


gcattgcgcc 


7320 


cagcgccatc 


tgatcgttgg 


caaccagcat 


cgcagtggga 


acgatgccct 


cattcagcat 


7380 


ttgcatggtt 


tgttgaaaac 


cggacatggc 


actccagtcg 


ccttcccgtt 


ccgctatcgg 


7440 


ctgaatttga 


ttgcgagtga 


gatatttatg 


ccagccagcc 


agacgcagac 


gcgccgagac 


7500 
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agaacttaat gggcccgcta acagcgcgat 
cacgcccagt cgcgtaccgt cttcatggga 
agagacatca agaaataacg ccggaacatt 
ctggtcatcc agcggatagt taatgatcag 
caccgccgct ttacaggctt cgacgccgct 
acccagttga tcggcgcgag atttaatcgc 
cagactggag gtggcaacgc caatcagcaa 
gcggttggga atgtaattca gctccgccat 
agaaacgtgg ctggcctggt tcaccacgcg 
ctctgcgaca tcgtataacg ttactggttt 
cgggcgctat catgccatac cgcgaaaggt 
caggtttccc gactggaaag cgggcagtga 
tcattaggca ccccaggctt tacactttat 
gagcggataa caatttcaca caggaaacag 
ccccctgttg acaattaatc atcggctcgt 
tttcacacag gaaacaggat cagcttactc 
ctcgtataat gtgtggaatt gtgagcggat 
gatgcttagg aggtcat 



ttgctggtga cccaatgcga ccagatgctc 7560 
gaaaataata ctgttgatgg gtgtctggtc 7620 
agtgcaggca gcttccacag caatggcatc 7680 
cccactgacg cgttgcgcga gaagattgtg 7740 
tcgttctacc atcgacacca ccacgctggc 7800 
cgcgacaatt tgcgacggcg cgtgcagggc 7860 
cgactgtttg cccgccagtt gttgtgccac 7920 
cgccgcttcc actttttccc gcgttttcgc 7980 
ggaaacggtc tgataagaga caccggcata 8040 
cacattcacc accctgaatt gactctcttc 8100 
tttgcgccat tcgatggtgt cctggcacga 8160 
gcgcaacgca attaatgtga gttagctcac 8220 
gcttccggct cgtataatgt gtggaattgt 8280 
gatcgatcca tcgatgagct tactccccat 8340 
ataatgtgtg gaattgtgag cggataacaa 8400 
cccatccccc tgttgacaat taatcatcgg 8460 
aacaatttca cacaggaaac aggatccatc 8520 

8537 



<210> 18 

<211> 8492 

<212> DNA 

<213> Human cells 



<400> 18 

atggctcgtc aagttcattc ttcttggaat 
atcgggaacc tcttccagtt ggaattgaag 
cagcgcttcg ggccggtgtt cacgctgtac 
ggctacaagg cggtgaagga agcgctgctg 
gacctccccg cgttccatgc gcacagggac 
tggaaggaca tccggcggtt ttccctgacc 
ggcaatgaga gccggatcca gagggaggcc 
caaggccagc ctttcgaccc caccttcctc 
gacatcctct tccgcaagca ttttgactac 
ttgtttaatg agaacttcca cctactcagc 
cccagctttc tacactactt gcctggaagc 
gtaaaagagt atgtgtctga aagggtgaag 
ccccgggacc tcaccgactg cctgctcgtg 



ctgcccccag gccctttccc gcttcccatc 60 
aatattccca agtccttcac ccggttggcc 12 0 
gtgggctcgc agcgcatggt ggtgatgcac 180 
gactacaagg acgagttctc gggcagaggc 240 
aggggaatca tttttaataa tggacctacc 300 
accctccgga actatgggat ggggaaacag 3 60 
cacttcctgc tggaagcact caggaagacc 420 
atcggctgcg cgccctgcaa cgtcatagcc 480 
aatgatgaga agtttctaag gctgatgtat 540 
actccctggc tccagcttta caataatttt 600 
cacagaaaag tcataaaaaa tgtggctgaa 660 
gagcaccatc aatctctgga ccccaactgt 720 
gaaatggaga aggaaaagca cagtgcagag 780 
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cgcttgtaca caatggacgg tatcaccgtg 
gagaccacca gcacaactct gagatatggg 
gaagagaagc tccatgaaga aattgacagg 
aaggataggc aagagatgcc ctacatggat 
accctcgtgc cctccaacct gccccatgaa 
ctcatcccca agggcacagt cgtagtgcca 
gaatttcctg atccagaaaa gtttaagcca 
aagtacagtg actatttcaa gccattttcc 
ctggctcgca tggagttgtt tcttttgttg 
cctctcgttg acccaaagga tatcgacctc 
ccaccacgtt acaaactctg tgtcattccc 
ccccccgctt caaacaagct tccggtctag 
tataaatatg gctgactccc acgtggacac 
agaagtatct cttttcagca tgacggacat 
aacctactgg ttcctcttca gaaagaaaaa 
gacattgacc tcctctgtca gagagagcag 
gaacatcatc gtgttctacg gctcccagac 
gtccaaggac gcccaccgct acgggatgcg 
cctggccgac ctgagcagcc tgccagagat 
cacctacggt gagggagacc ccaccgacaa 
gacagacgtg gatctctctg gggtcaagtt 
cgagcacttc aatgccatgg gcaagtacgt 
gcgcatcttt gagctggggt tgggcgacga 
ctggcgagag cagttctggc cggccgtgtg 
ggagtccagc attcgccagt acgagcttgt 
gtacatgggg gagatgggcc ggctgaagag 
caagaatccg ttcctggctg cagtcaccac 
ccacctcatg cacctggaat tggacatctc 
ccacgtggct gtgtacccag ccaacgactc 
gggtgccgac ctggacgtcg tcatgtccct 
gcacccattc ccgtgcccta cgtcctaccg 
caacccgccg cgtaccaacg tgctgtacga 
gcaggagctg ctgcgcaaga tggcctcctc 
ctgggtggtg gaggcccgga ggcacatcct 
gccccccatc gaccacctgt gtgagctgct 
cgcctcatcc tccaaggtcc accccaactc 
cgagaccaag gccggccgca tcaacaaggg 
gcctgccggg gagaacggcg gccgtgcgct 
ccgcctgccc ttcaaggcca ccacgcctgt 
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—5 /-^ 4-» *T 4» /"I /""T /** f*t 

accgcggccy 


acctyucctt 


tgeggggaca 


Oft U 


*m 4- /h 4» M 4— 4- M 

ctccty atcc 


ucatgaaaca 


ccctgagatc 


on n 
yuu 


gtgacugggc 


caagccgaac 


ccctgccat c 




gcugcggcgc 


acgagac tea 


gcggtccacc 


i no n 


gcaacccgag 


acaccactLL 


cagaggatac 


1 non 
IUO U 


actctggact 


ctgttttgta 


cgacaaccaa 


1 J.fi U 


gaacacttcc 


tgaatgaaaa 


tggaaag ttc 


iftftft 
l^UU 


acaggaaaac 


gagtgtgtgc 


eggagaagge 


1 TCft 
1Z D U 


tgtgccattt 


tgcagcattt 


f- a l- ♦* /~T a ^ r^f 
tdaiL uy clay 




agccctatac 


atattgggtt 


cy y c cy Cci uc 




cgctcatgag 


tgtgtggagg 


ttCaCCCtyaa 


i 4 4 n 


aaataatttt 


gtttaacttt 


aagaaggaga 


i C ft ft 


cagctccacc gtgtccgagg 


cggcggccga 


1 ccn 
lb bU 


gattctgttt 


tcgctcatcg 


4— +^ ****** 4* /« <4» m a 4» 

tgggccxcct 


icon 


agaagaagtc 


cccgagttca 


ccaaaacuca 


1/TQA 

loflU 


ctttgtggaa 


aagatgaaga 


aaaeggggag 


1 *7 A ft 


ggggactgca 


gaggagtttg 


ccaaccgcct 


1 Q ft ft 


aggcatgtca 


gcggaccctg 


aggagtatga 


1 0 £ ft 


cgacaacgcc 


ctggtggttt 


4^ ^ 4» 4» ***** n 

LCCyCdLygc 


1 QTft 


tgcccaggac 


ttctacgact 


****** *+ 4> mm ^ /<« 

ggccgcagga 


1 Q Q ft 


cgcggtgttt 


ggtcttggga 


acaagaccta 


O ft A ft 

ZU4U 


ggacaagcgg 


ctggagcagc 


tcggcgccca 


n 1 ft ft 


cgatgggaac 


ttggaggagg 


acttcatcac 


0 1 C ft 


tgaacacttt 


ggggtggaag 


ccactggcga 




ggtccacacc 


gacatagatg 


eggecaaggt 


O *> O ft 


ctacgagaac 


cagaagcccc 


cc tttgatgc 


O "3 yi ft 
Z J 4 U 


caaccggaag 


ctgaaccagg 


gaaccgagcg 


/i ft ft 


ggactccaaa 


atcaggtatg 


aatctgggga 


O A C ft 

z4 bU 


tgctctcgtc 


aaccagctgg 


gcaaaatccc 




gaacaacctg gatgaggagt 


ccaacaagaa 


neon 


cacggccctc 


acctactacc 


tggacatcac 


2o4U 


gctggcgcag 


tacgcctcgg 


agccctcgga 


2700 


ctccggcgag 


ggcaaggagc 


tgtacctgag 


2760 


ggccatcctg 


caggactgcc 


cgtccctgcg 


2820 


gccgcgcctg 


caggcccgct 


actactccat 


2880 


tgtgcacatc 


tgtgcggtgg 


ttgtggagta 


2940 


cgtggccacc 


aactggctgc 


gggccaagga 


3000 


ggtgcccatg 


ttcgtgcgca 


agtcccagtt 


3060 


catcatggtg ggccccggca 


ccggggtggc 


3120 
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acccttcata ggcttcatcc aggagcgggc 
ggagacgctg ctgtactacg gctgccgccg 
cgtggcgcag ttccacaggg acggtgcgct 
gcagtcccac aaggtctacg tccagcacct 
gttgatcgaa ggcggtgccc acatctacgt 
tgtgcagaac accttctacg acatcgtggc 
ggtggactac atcaagaaac tgatgaccaa 
gaagcttatc gatgataagc tgtcaaacat 
cttttttttc agatctgctt gaagacgaaa 
taatgtcatg ataataatgg tttcttagcg 
ggcgcattaa gcgcggcggg tgtggtggtt 
gccctagcgc ccgctccttt cgctttcttc 
ccccgtcaag ctctaaatcg ggggctccct 
ctcgacccca aaaaacttga tttgggtgat 
acggtttttc gccctttgac gttggagtcc 
actggaacaa cactcaaccc tatctcgggc 
atttcggcct attggttaaa aaatgagctg 
aaaatattaa cgtttacaat ttcaggtggc 
atttgtttat ttttctaaat acattcaaat 
taaatgcttc aataatattg aaaaaggaag 
cttattccct tttttgcggc attttgcctt 
aaagtaaaag atgctgaaga tcagttgggt 
aacagcggta agatccttga gagttttcgc 
tttaaagttc tgctatgtgg cgcggtatta 
ggtcgccgca tacactattc tcagaatgac 
cacctcacgg atggcatgac agtaagagaa 
aacactgcgg ccaacttact tctgacaacg 
ttgcacaaca tgggggatca tgcaactcgc 
gccataccaa acgacgagcg tgacaccacg 
aaactattaa ctggcgaact acttactcta 
gaggcggata aagttgcagg accacttctg 
gctgataaat ctggagccgg tgagcgtggg 
gatggtaagc cctcccgtat cgtagttatc 
gaacgaaata gacagatcgc tgagataggt 
gaccaagttt actcatatat actttagatt 
atctaggtga agatcctttt tgataatctc 
ttccactgag cgtcagaccc cgtagaaaag 
ctgcgcgtaa tctgctgctt gcaaacaaaa 
ccggatcaag agctaccaac tctttttccg 
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ctggctgcga 


cagcagggca 


aggaggtggg 


JlOU 


ctcggatgag 


gactacctgt 


accgggagga 


jz4 U 


cacccagctc 


aacgtggcct 


ccccccggga 




gctaaagcaa gaccgagagc 


acc ugnggaa 


J JO U 


ctgtggggat gcacggaaca 


cggccaggga 




tgagctcggg gccatggagc 


acgcgcaggc 


a q n 


gggccgctac 


tccctggacg 


tgtggagcca 


i c a ft 


gagcagatct 


gagcccgcct 


ac* tyciy cyyy 




gggcctcgtg 


atacgcctat 


u t. u ua udy y u 


JDOv 


tcaaagcaac 


catagtacgc 






acgcgcagcg 


tgaccgctac 


dCLLyCtayC 


^ 7fln 

J / O u 


ccttcctttc 


tcgccacgtt 


cgccggccxc 


1 OA ft 


ttagggttcc 


gatttagtgc 


ccxacggcac 


i on n 

J 27 U u 


ggttcacgta gtgggccatc 


gccc uy a cay 


ocn 

J i7D u 


acgttcttta 


atagtggact 


C t ty t tCCaa 


a no ft 


tattcttttg 


atttataagg 


gacxccg ccy 


A ft A ft 


atttaacaaa 


aatttaacgc 


gaauu ucaac 




acttttcggg gaaatgtgcg 


cggaacccct 


A "5 ft ft 


atgtatccgc 


tcatgagaca 


acaaccc eg a 




agtatgagta 


ttcaacattt 


ccytgLCycc 


A O ft 
4 J Z U 


cctgtttttg ctcacccaga 


^ ^ ^ ft ft ^ 

aacgeuggeg 


A 1 0 ft 


gcacgagtgg gttacatcga 


actggatctc 


4 44 U 


cccgaagaac 


gttttccaat 


gatgagcact 


4 bU u 


tcccgtgttg 


acgccgggca 


ag age a act c 


a c: c r\ 
4 o b U 


ttggttgagt 


actcaccagt 


cacayaaaag 


A CO ft 


ttatgcagtg 


ctgccataac 


catgagtgat 


/I CQ A 

4 fa oU 


atcggaggac 


cgaaggagct 


^ j-m n m 4- W A- ^- 

aaccgcLLLL 


A *7 A ft 
4 /4U 


cttgatcgtc 


gggaaccgga 


gc cgaacgaa 


A Q ft ft 

4 oUU 


atgcctgcag caatggcaac 


aacgttgcgc 


A O £ ft 
4 ooU 


gcttcccggc 


aacaattaat 


agactggatg 


4 yzu 


cgctcggccc 


ttccggctgg 


cnggcnuacE 


x con 
4 y oU 


tctcgcggta 


tcattgcagc 


actggggeca 


5040 


tacacgacgg 


ggagtcaggc 


aactatggat 


5100 


gcctcactga 


ttaagcattg 


gtaactgtca 


5160 


gatttaaaac 


ttcattttta 


atttaaaagg 


5220 


atgaccaaaa 


tcccttaacg 


tgagttttcg 


5280 


atcaaaggat 


cttcttgaga 


tccttttttt 


5340 


aaaccaccgc 


taccagcggt 


ggtttgtttg 


5400 


aaggtaactg gcttcagcag 


agegcagata 


5460 
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ccaaatactg tccttctagt gtagccgtag 
ccgcctacat acctcgctct gctaatcctg 
tcgtgtctta ccgggttgga ctcaagacga 
tgaacggggg gttcgtgcac acagcccagc 
tacctacagc gtgagctatg agaaagcgcc 
tatccggtaa gcggcagggt cggaacagga 
gcctggtatc tttatagtcc tgtcgggttt 
tgatgctcgt caggggggcg gagcctatgg 
ttcctggcct tttgctggcc ttttgctcac 
gtggataacc gtattaccgc ctttgagtga 
gagcgcagcg agtcagtgag cgaggaagcg 
acgcatctgt gcggtatttc acaccgcata 
atgccgcata gttaagccag tatacactcc 
gccccgacac ccgccaacac ccgctgacgc 
cgcttacaga caagctgtga ccgtctccgg 
atcaccgaaa cgcgcgaggc agaacgccat 
gccagctttc atcaacatta aatgtgagcg 
caaacggcgg attgaccgta atgggatagg 
cgtgcatctg ccagtttgag gggacgacga 
gccagctttc cggcaccgct tctggtgccg 
ggctgcgcaa ctgttgggaa gggcgatcgg 
cgaaaggggg atgtgctgca aggcgattaa 
gacgttgtaa aacgacggcc agtgaatccg 
aattgttatc cgctcacaat tccacacaac 
tggggtgcct aatgagtgag ctaactcaca 
cagtcgggaa acctgtcgtg ccagctgcat 
ggtttgcgta ttgggcgcca gggtggtttt 
attgcccttc accgcctggc cctgagagag 
cagcaggcga aaatcctgtt tgatggtggt 
ggtatcgtcg tatcccacta ccgagatatc 
ggcgcgcatt gcgcccagcg ccatctgatc 
gccctcattc agcatttgca tggtttgttg 
ccgttccgct atcggctgaa tttgattgcg 
cagacgcgcc gagacagaac ttaatgggcc 
tgcgaccaga tgctccacgc ccagtcgcgt 
gatgggtgtc tggtcagaga catcaagaaa 
cacagcaatg gcatcctggt catccagcgg 
cgcgagaaga ttgtgcaccg ccgctttaca 
caccaccacg ctggcaccca gttgatcggc 



ttaggccacc acttcaagaa ctctgtagca 5520 
ttaccagtgg ctgctgccag tggcgataag 5580 
tagttaccgg ataaggcgca gcggtcgggc 5640 
ttggagcgaa cgacctacac cgaactgaga 5700 
acgcttcccg aagggagaaa ggcggacagg 5760 
gagcgcacga gggagcttcc agggggaaac 5820 
cgccacctct gacttgagcg tcgatttttg 5880 
aaaaacgcca gcaacgcggc ctttttacgg 594 0 
atgttctttc ctgcgttatc ccctgattct 6000 
gctgataccg ctcgccgcag ccgaacgacc 6060 
gaagagcgcc tgatgcggta ttttctcctt 6120 
tatggtgcac tctcagtaca atctgctctg 6180 
gctatcgcta cgtgactggg tcatggctgc 6240 
gccctgacgg gcttgtctgc tcccggcatc 6300 
gagctgcatg tgtcagaggt tttcaccgtc 6360 
caaaaataat tcgcgtctgg ccttcctgta 6420 
agtaacaacc cgtcggattc tccgtgggaa 6480 
ttacgttggt gtagatgggc gcatcgtaac 6540 
cagtatcggc ctcaggaaga tcgcactcca 6600 
gaaaccaggc aaagcgccat tcgccattca 6660 
tgcgggcctc ttcgctatta cgccagctgg 6720 
gttgggtaac gccagggttt tcccagtcac 6780 
taatcatggt catagctgtt tcctgtgtga 6840 
atacgagccg gaagcataaa gtgtaaagcc 6900 
ttaattgcgt tgcgctcact gcccgctttc 6960 
taatgaatcg gccaacgcgc ggggagaggc 7020 
tcttttcacc agtgagacgg gcaacagctg 7080 
ttgcagcaag cggtccacgc tggtttgccc 7140 
taacggcggg atataacatg agctgtcttc 7200 
cgcaccaacg cgcagcccgg actcggtaat 7260 
gttggcaacc agcatcgcag tgggaacgat 7320 
aaaaccggac atggcactcc agtcgccttc 7380 
agtgagatat ttatgccagc cagccagacg 744 0 
cgctaacagc gcgatttgct ggtgacccaa 7500 
accgtcttca tgggagaaaa taatactgtt 7560 
taacgccgga acattagtgc aggcagcttc 7620 
atagttaatg atcagcccac tgacgcgttg 7680 
ggcttcgacg ccgcttcgtt ctaccatcga 7740 
gcgagattta atcgccgcga caatttgcga 7800 
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cggcgcgtgc agggccagac tggaggtggc 

o 

cagttgttgt .gccacgcggt tgggaatgta 
ttcccgcgtt ttcgcagaaa cgtggctggc 
agagacaccg gcatactctg cgacatcgta 
gaattgactc tcttccgggc gctatcatgc 
ggtgtcctgg cacgacaggt ttcccgactg 
tgtgagttag ctcactcatt aggcacccca 
aatgtgtgga attgtgagcg gataacaatt 
gagcttactc cccatccccc tgttgacaat 
gtgagcggat aacaatttca cacaggaaac 
acaattaatc atcggctcgt ataatgtgtg 
gaaacaggat ccatcgatgc ttaggaggtc 



aacgccaatc agcaacgact gtttgcccgc 7860 
attcagctcc gccatcgccg cttccact.tt 7920 
ctggttcacc acgcgggaaa cggtctgata 7980 
taacgttact ggtttcacat tcaccaccct 8040 
cataccgcga aaggttttgc gccattcgat 8100 
gaaagcgggc agtgagcgca acgcaattaa 8160 
ggctttacac tttatgcttc cggctcgtat 8220 
tcacacagga aacaggatcg atccatcgat 82 80 
taatcatcgg ctcgtataat gtgtggaatt 8340 
aggatcagct tactccccat ccccctgttg 84 00 
gaattgtgag cggataacaa tttcacacag 8460 
at 8492 



<210> 19 

<211> 8475 

<212> DNA 

<213> Human cells 



<400> 19 



atggctcgac 


aatcttctgg 


acgaggaaaa 


ctccctcctg 


gccctactcc 


tctcccagtg 


60 


attggaaata 


tcctacagat 


agatattaag gatgtcagca 


aatccttaac 


caatctctca 


120 


aaaatctatg 


gccctgtgtt 


cactctgtat 


tttggcctgg 


aacgcatggt 


ggtgctgcat 


180 


ggatatgaag 


tggtgaagga 


agccctgatt 


gatcttggag 


aggagttttc 


tggaagaggc 


240 


catttcccac 


tggctgaaag 


agctaacaga ggatttggaa 


tcgttttcag 


caatggaaag 


300 


agatggaagg 


agatccggcg 


tttctccctc 


atgacgctgc 


ggaattttgg 


gatggggaag 


360 


aggagcattg 


aggaccgtgt 


tcaagaggaa gcccgctgcc 


ttgtggagga gttgagaaaa 


420 


accaaggctt 


caccctgtga 


tcccactttc 


atcctgggct 


gtgctccctg 


caatgtgatc 


480 


tgctccatta 


ttttccagaa 


acgtttcgat 


tataaagatc 


agcaatttct 


taacttgatg 


540 


gaaaaattga 


atgaaaacat 


caggattgta agcaccccct 


ggatccagat 


atgcaataat 


600 


tttcccacta 


tcattgatta 


tttcccggga 


acccataaca 


aattacttaa 


aaaccttgct 


660 


tttatggaaa 


gtgatatttt 


ggagaaagta 


aaagaacacc 


aagaatcgat 


ggacatcaac 


720 


aaccctcggg 


actttattga 


ttgcttcctg 


atcaaaatgg 


agaaggaaaa 


gcaaaaccaa 


780 


cagtctgaat 


tcactattga 


aaacttggta 


atcactgcag 


ctgacttact 


tggagctggg 


840 


acagagacaa 


caagcacaac 


cctgagatat 


gctctccttc 


tcctgctgaa gcacccagag 


900 


gtcacagcta 


aagtccagga 


agagattgaa 


cgtgtcgttg 


gcagaaaccg gagcccctgc 


960 


atgcaggaca 


ggggccacat 


gccctacaca gatgctgtgg 


tgcacgaggt 


ccagagatac 


1020 


atcgacctca 


tccccaccag 


cctgccccat 


gcagtgacct 


gtgacgttaa attcagaaac 


1080 


tacctcattc 


ccaagggcac 


aaccatatta 


acttccctca 


cttctgtgct 


acatgacaac 


1140 
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aaagaatttc ccaacccaga gatgtttgac 
tttaagaaaa gtaactactt catgcctttc 
ggcctggccc gcatggagct gtttttattc 
aaatctctga ttgacccaaa ggaccttgac 
gtcccgccct tctatcagct gtgcttcatt 
gcttccggtc tagaaataat tttgtttaac 
cccacgtgga caccagctcc accgtgtccg 
gcatgacgga catgattctg ttttcgctca 
tcagaaagaa aaaagaagaa gtccccgagt 
tcagagagag cagctttgtg gaaaagatga 
acggctccca gacggggact gcagaggagt 
gctacgggat gcgaggcatg tcagcggacc 
gcctgccaga gatcgacaac gccctggtgg 
accccaccga caatgcccag gacttctacg 
ctggggtcaa gttcgcggtg tttggtcttg 
tgggcaagta cgtggacaag cggctggagc 
ggttgggcga cgacgatggg aacttggagg 
ggccggccgt gtgtgaacac tttggggtgg 
agtacgagct tgtggtccac accgacatag 
gccggctgaa gagctacgag aaccagaagc 
ctgcagtcac caccaaccgg aagctgaacc 
aattggacat ctcggactcc aaaatcaggt 
cagccaacga ctctgctctc gtcaaccagc 
tcgtcatgtc cctgaacaac ctggatgagg 
ctacgtccta ccgcacggcc ctcacctact 
acgtgctgta cgagctggcg cagtacgcct 
agatggcctc ctcctccggc gagggcaagg 
ggaggcacat cctggccatc ctgcaggact 
tgtgtgagct gctgccgcgc ctgcaggccc 
tccaccccaa ctctgtgcac atctgtgcgg 
gcatcaacaa gggcgtggcc accaactggc 
gcggccgtgc gctggtgccc atgttcgtgc 
ccaccacgcc tgtcatcatg gtgggccccg 
tccaggagcg ggcctggctg cgacagcagg 
acggctgccg ccgctcggat gaggactacc 
gggacggtgc gctcacccag ctcaacgtgg 
acgtccagca cctgctaaag caagaccgag 
cccacatcta cgtctgtggg gatgcacgga 
acgacatcgt ggctgagctc ggggccatgg 



cctcgtcact ttctggatga aggtggaaat 1200 
tcagcaggaa aacggatttg tgtgggagag 1260 
ctgaccttca ttttacagaa ctttaacctg 1320 
acaactcctg ttgtcaatgg atttgcttct 1380 
cctgtctgaa gaagcacaga tggtctggaa 1440 
tttaagaagg agatataaat atggctgact 1500 
aggcggtggc cgaagaagta tctcttttca 1560 
tcgtgggtct cctaacctac tggttcctct 1620 
tcaccaaaat tcagacattg acctcctctg 1680 
agaaaacggg gaggaacatc atcgtgttct 174 0 
ttgccaaccg cctgtccaag gacgcccacc 1800 
ctgaggagta tgacctggcc gacctgagca 1860 
ttttctgcat ggccacctac ggtgagggag 1920 
actggctgca ggagacagac gtggatctct 1980 
ggaacaagac ctacgagcac ttcaatgcca 2040 
agctcggcgc ccagcgcatc tttgagctgg 2100 
aggacttcat cacctggcga gagcagttct 2160 
aagccactgg cgaggagtcc agcattcgcc 2220 
atgcggccaa ggtgtacatg ggggagatgg 22 80 
ccccctttga tgccaagaat ccgttcctgg 2340 
agggaaccga gcgccacctc atgcacctgg 24 00 
atgaatctgg ggaccacgtg gctgtgtacc 2460 
tgggcaaaat cctgggtgcc gacctggacg 2520 
agtccaacaa gaagcaccca ttcccgtgcc 2580 
acctggacat caccaacccg ccgcgtacca 2640 
cggagccctc ggagcaggag ctgctgcgca 2700 
agctgtacct gagctgggtg gtggaggccc 2760 
gcccgtccct gcggcccccc atcgaccacc 2820 
gctactactc catcgcctca tcctccaagg 2880 
tggttgtgga gtacgagacc aaggccggcc 2940 
tgcgggccaa ggagcctgcc ggggagaacg 3000 
gcaagtccca gttccgcctg cccttcaagg 3060 
gcaccggggt ggcacccttc ataggcttca 3120 
gcaaggaggt gggggagacg ctgctgtact 3180 
tgtaccggga ggacgtggcg cagttccaca 3240 
ccttctcccg ggagcagtcc cacaaggtct 3300 
agcacctgtg gaagttgatc gaaggcggtg 3360 
acatggccag ggatgtgcag aacaccttct 3420 
agcacgcgca ggcggtggac tacatcaaga 3480 
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ctgtccttct 
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catacctcgc 
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aagtcgtgtc 


ttaccgggtt 


5580 


ggactcaaga 


cgatagttac 


eggataagge geageggteg 


ggctgaaegg ggggttcgtg 
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cacacagccc 


agcttggagc 


gaacgaccta 
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agatacctac 


agegtgaget 


5700 


atgagaaagc 


gccacgcttc 


ccgaagggag 


aaaggeggae 


aggtatcegg 


taagcggcag 


5760 


ggteggaaca 


ggagagegea 


cgagggagct 


tccaggggga 


aacgcctggt 


atctttatag 


5820 
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tcctgtcggg tttcgccacc tctgacttga 
gcggagccta tggaaaaacg ccagcaacgc 
gccttttgct cacatgttct ttcctgcgtt 
cgcctttgag tgagctgata ccgctcgccg 
gagcgaggaa gcggaagagc gcctgatgcg 
ttcacaccgc atatatggtg cactctcagt 
cagtatacac tccgctatcg ctacgtgact 
cacccgctga cgcgccctga cgggcttgtc 
tgaccgtctc cgggagctgc atgtgtcaga 
ggcagaacgc catcaaaaat aattcgcgtc 
ttaaatgtga gcgagtaaca acccgtcgga 
gtaatgggat aggttacgtt ggtgtagatg 
gaggggacga cgacagtatc ggcctcagga 
gcttctggtg ccggaaacca ggcaaagcgc 
gaagggcgat cggtgcgggc ctcttcgcta 
gcaaggcgat taagttgggt aacgccaggg 
gccagtgaat ccgtaatcat ggtcatagct 
aattccacac aacatacgag ccggaagcat 
gagctaactc acattaattg cgttgcgctc 
gtgccagctg cattaatgaa tcggccaacg 
ccagggtggt ttttcttttc accagtgaga 
ggccctgaga gagttgcagc aagcggtcca 
gtttgatggt ggttaacggc gggatataac 
ctaccgagat atccgcacca acgcgcagcc 
gcgccatctg atcgttggca accagcatcg 
gcatggtttg ttgaaaaccg gacatggcac 
gaatttgatt gcgagtgaga tatttatgcc 
aacttaatgg gcccgctaac agcgcgattt 
cgcccagtcg cgtaccgtct tcatgggaga 
agacatcaag aaataacgcc ggaacattag 
ggtcatccag cggatagtta atgatcagcc 
ccgccgcttt acaggcttcg acgccgcttc 
ccagttgatc ggcgcgagat ttaatcgccg 
gactggaggt ggcaacgcca atcagcaacg 
ggttgggaat gtaattcagc tccgccatcg 
aaacgtggct ggcctggttc accacgcggg 
ctgcgacatc gtataacgtt actggtttca 
ggcgctatca tgccataccg cgaaaggttt 
ggtttcccga ctggaaagcg ggcagtgagc 
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agatcgcact 


ccagccagct 


ttccggcacc 
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cattcgccat 


tcaggctgcg 


caactgttgg 


6660 


ttacgccagc 


tggcgaaagg gggatgtgct 


6720 


ttttcccagt 


cacgacgttg 


taaaacgacg 


6780 


gtttcctgtg 


tgaaattgtt 


atccgctcac 


6840 


aaagtgtaaa 


gcctggggtg 


cctaatgagt 


6900 


actgcccgct 


ttccagtcgg 


gaaacctgtc 


6960 


cgcggggaga 


ggcggtttgc 


gtattgggcg 


7020 


cgggcaacag 


ctgattgccc 


ttcaccgcct 


7080 


cgctggtttg 


ccccagcagg 


cgaaaatcct 


7140 


atgagctgtc 


ttcggtatcg 


tcgtatccca 


7200 


cggactcggt 


aatggcgcgc 


attgcgccca 


7260 


cagtgggaac 


gatgccctca 


ttcagcattt 


7320 


tccagtcgcc 


ttcccgttcc 


gctatcggct 


7380 


agccagccag 


acgcagacgc 


gccgagacag 


7440 


gctggtgacc 


caatgcgacc 


agatgctcca 


7500 


aaataatact 


gttgatgggt 


gtctggtcag 


7560 


tgcaggcagc 


ttccacagca 


atggcatcct 


7620 


cactgacgcg 


ttgcgcgaga agattgtgca 


7680 


gttctaccat 


cgacaccacc 


acgctggcac 


7740 


cgacaatttg 


cgacggcgcg 


tgcagggcca 


7800 


actgtttgcc 


cgccagttgt 


tgtgccacgc 


7860 


ccgcttccac 


tttttcccgc 


gttttcgcag 


7920 


aaacggtctg 


ataagagaca 


ccggcatact 


7980 


cattcaccac 


cctgaattga 


ctctcttccg 


8040 


tgcgccattc 


gatggtgtcc 


tggcacgaca 


8100 


gcaacgcaat 


taatgtgagt 


tagctcactc 


8160 
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attaggcacc ccaggcttta cactttatgc 
gcggataaca atttcacaca ggaaacagga 
ccctgttgac aattaatcat cggctcgtat 
tcacacagga aacaggatca gcttactccc 
cgtataatgt gtggaattgt gagcggataa 
tgcttaggag gtcat 
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ttccggctcg tataatgtgt ggaattgtga 8220 

tcgatccatc gatgagctta ctccccatcc 8280 

aatgtgtgga attgtgagcg gataacaatt 8340 

catccccctg ttgacaatta atcatcggct 8400 

caatttcaca caggaaacag gatccatcga 8460 

8475 



<210> 20 

<211> 1215 

<212> DNA 

<213> Human cells 



<220> 

<221> CDS 

<222> (1) . . (1212) 



<400> 20 

atg act gcg aca gtc get teg aca 
Met Thr Ala Thr Val Ala Ser Thr 
1 5 



agt ctg ttc acc acc gee gac cac 48 
Ser Leu Phe Thr Thr Ala Asp His 
10 15 



tac cac act ccg ttg ggt ccc gac ggc acg cca cac gca ttc ttc gag 96 
Tyr His Thr Pro Leu Gly Pro Asp Gly Thr Pro His Ala Phe Phe Glu 
20 25 30 



gcg ttg cgc gac gag gcg gag acc acc cct att gga tgg agt gag gee 144 
Ala Leu Arg Asp Glu Ala Glu Thr Thr Pro lie Gly Trp Ser Glu Ala 
35 40 45 

tac ggt ggc cac tgg gta gtt gec gga tac aag gag att cag get gtc 192 
Tyr Gly Gly His Trp Val Val Ala Gly Tyr Lys Glu lie Gin Ala Val 
50 55 60 



ate cag aac acc aag gee ttc teg aac aag ggc gtg acc ttc cct cgc 240 

lie Gin Asn Thr Lys Ala Phe Ser Asn Lys Gly Val Thr Phe Pro Arg 

65 70 75 80 

tac gaa acc ggc gag ttc gag etc atg atg gca ggg cag gac gac ccc 288 
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Tyr Glu Thr Gly Glu Phe Glu Leu Met Met Ala Gly Gin Asp Asp Pro 

85 90 95 

gtc cac aag aag tac cgc cag etc gtg gcg aaa ccg ttc teg ccg gag 336 

Val His Lys Lys Tyr Arg Gin Leu Val Ala Lys Pro Phe Ser Pro Glu 

100 105 110 

gcg acc gac ctg ttc acc gaa cag ttg cgc cag tec ace aac gac ctg 384 

Ala Thr Asp Leu Phe Thr Glu Gin Leu Arg Gin Ser Thr Asn Asp Leu 

115 120 125 

ate gac gcg egg ate gaa etc gga gag ggt gac get gec acc tgg tta 432 

lie Asp Ala Arg lie Glu Leu Gly Glu Gly Asp Ala Ala Thr Trp Leu 

130 135 140 

gec aac gag att cct get egg eta acg gec ate etc etc ggc ctt ccg 480 

Ala Asn Glu lie Pro Ala Arg Leu Thr Ala lie Leu Leu Gly Leu Pro 

145 150 155 160 

ccc gaa gac gga gac acc tac cgc egg tgg gtg tgg gec ate acg cac 528 

Pro Glu Asp Gly Asp Thr Tyr Arg Arg Trp Val Trp Ala lie Thr His 

165 170 175 

gta gag aac ccg gag gaa ggc gee gaa att ttc gee gaa ctt gtg gcg 576 

Val Glu Asn Pro Glu Glu Gly Ala Glu lie Phe Ala Glu Leu Val Ala 

180 185 190 

cac gcg cgc act ctg ate gec gag cgc cgc act aac cca ggc aac gac 624 

His Ala Arg Thr Leu lie Ala Glu Arg Arg Thr Asn Pro Gly Asn Asp 

195 200 205 

ate atg tct egg gtc ate atg tec aag ate gac ggt gaa age etc tec 672 

lie Met Ser Arg Val lie Met Ser Lys He Asp Gly Glu Ser Leu Ser 

210 215 220 

gag gac gac ctg ate gga ttc ttc acc ate ctg ctt etc ggc ggc ate 72 0 

Glu Asp Asp Leu He Gly Phe Phe Thr He Leu Leu Leu Gly Gly He 

225 230 235 240 
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gac aac acc gca cgc ttc etc age teg gtg ttc tgg cgc ttg gcg tgg 768 
Asp Asn Thr Ala Arg Phe Leu Ser Ser Val Phe Trp Arg Leu Ala Trp 
245 250 255 

gac ate gag etc cgc cgc cga etc ate gee cac ccc gag ctt ate ccc 816 
Asp lie Glu Leu Arg Arg Arg Leu lie Ala His Pro Glu Leu lie Pro 
260 265 270 

aac gec gta gac gaa ctg etc cgc ttc tac ggc ccc gec atg gtc ggc 864 
Asn Ala Val Asp Glu Leu Leu Arg Phe Tyr Gly Pro Ala Met Val Gly 
275 280 285 

cgc etc gtc acc caa gag gtc acc gtg ggc gac ate aca atg aag cca 912 
Arg Leu Val Thr Gin Glu Val Thr Val Gly Asp He Thr Met Lys Pro 
290 295 300 

ggc caa acg gec atg ctg tgg ttc ccg ate gee age egg gat cgt teg 960 
Gly Gin Thr Ala Met Leu Trp Phe Pro He Ala Ser Arg Asp Arg Ser 
305 310 315 320 

gca ttc gat tea ccc gac aac ate gtg ate gaa cgc acg ccc aac egg 1008 
Ala Phe Asp Ser Pro Asp Asn He Val He Glu Arg Thr Pro Asn Arg 
325 330 335 

cac ctg tct etc ggc cac ggc ate cac cgc tgc etc gga gee cac ctg 1056 
His Leu Ser Leu Gly His Gly He His Arg Cys Leu Gly Ala His Leu 
340 345 350 

ate cgc gtc gag gee cgc gtg gcg ate acc gag ttc etc aag cgc att 1104 
He Arg Val Glu Ala Arg Val Ala He Thr Glu Phe Leu Lys Arg He 
355 360 365 

ccg gag ttc tec etc gac ccg aac aag gaa tgt gag tgg tta atg ggc 1152 
Pro Glu Phe Ser Leu Asp Pro Asn Lys Glu Cys Glu Trp Leu Met Gly 
370 375 380 



cag gtg gee ggc atg ctg cac gtg ccg ate 
Gin Val Ala Gly Met Leu His Val Pro He 
385 390 



ate ttc cct aag ggc aag 
He Phe Pro Lys Gly Lys 
395 400 



1200 



WO'01/14565 



PCT/AUOO/00966 



68 



cga ctg teg gaa tga 1215 
Arg Leu Ser Glu 



<210> 21 

<211> 404 

<212> PRT 

<213> Human cells 

<400> 21 

Met Thr Ala Thr Val Ala Ser Thr Ser Leu Phe Thr Thr Ala Asp His 
15 10 15 

Tyr His Thr Pro Leu Gly Pro Asp Gly Thr Pro His Ala Phe Phe Glu 
20 25 30 

Ala Leu Arg Asp Glu Ala Glu Thr Thr Pro lie Gly Trp Ser Glu Ala 
35 40 45 

Tyr Gly Gly His Trp Val Val Ala Gly Tyr Lys Glu He Gin Ala Val 
50 55 60 

He Gin Asn Thr Lys Ala Phe Ser Asn Lys Gly Val Thr Phe Pro Arg 
65 70 75 80 

Tyr Glu Thr Gly Glu Phe Glu Leu Met Met Ala Gly Gin Asp Asp Pro 
85 90 95 

Val His Lys Lys Tyr Arg Gin Leu Val Ala Lys Pro Phe Ser Pro Glu 
100 105 110 

Ala Thr Asp Leu Phe Thr Glu Gin Leu Arg Gin Ser Thr Asn Asp Leu 
115 120 125 

He Asp Ala Arg He Glu Leu Gly Glu Gly Asp Ala Ala Thr Trp Leu 
130 135 140 



Ala Asn Glu He Pro Ala Arg Leu Thr Ala He Leu Leu Gly Leu Pro 
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145 



150 



155 



160 



Pro Glu Asp Gly Asp Thr Tyr Arg Arg Trp Val Trp Ala lie Thr His 
165 170 175 

Val Glu Asn Pro Glu Glu Gly Ala Glu . He Phe Ala Glu Leu Val Ala 
180 185 190 

His Ala Arg Thr Leu He Ala Glu Arg Arg Thr Asn Pro Gly Asn Asp 
195 200 205 

He Met Ser Arg Val He Met Ser Lys He Asp Gly Glu Ser Leu Ser 
210 215 220 

Glu Asp Asp Leu lie Gly Phe Phe Thr He Leu Leu Leu Gly Gly He 
225 230 235 240 

Asp Asn Thr Ala Arg Phe Leu Ser Ser Val Phe Trp Arg Leu Ala Trp 
245 250 255 

Asp He Glu Leu Arg Arg Arg Leu He Ala His Pro Glu Leu lie Pro 
260 265 270 

Asn Ala Val Asp Glu Leu Leu Arg Phe Tyr Gly Pro Ala Met Val Gly 
275 280 285 

Arg Leu Val Thr Gin Glu Val Thr Val Gly Asp He Thr Met Lys Pro 
290 295 300 

Gly Gin Thr Ala Met Leu Trp Phe Pro He Ala Ser Arg Asp Arg Ser 
305 310 315 320 

Ala Phe Asp Ser Pro Asp Asn He Val He Glu Arg Thr Pro Asn Arg 
325 330 335 



His Leu Ser Leu Gly His Gly He His Arg Cys Leu Gly Ala His Leu 
340 345 350 



lie Arg Val Glu Ala Arg Val Ala lie Thr Glu Phe Leu Lys Arg lie 
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355 360 365 

Pro Glu Phe Ser Leu Asp Pro Asn Lys Glu Cys Glu Trp Leu Met Gly 
370 375 380 

Gin Val Ala Gly Met Leu His Val Pro lie He Phe Pro Lys Gly Lys 
385 390 395 400 



Arg Leu Ser Glu 



<210> 23 

<211> 154 

<212> PRT 

<213> Human cells 

<400> 23 

Met Asn Ala Leu He Leu Tyr Gly Thr Glu Thr Gly Asn Ala Glu Ala 
15 10 15 

Cys Ala Thr Thr He Ser Gin Val Leu Ala Asp Thr Val Asp Thr Lys 
20 25 30 

Val His Asp Leu Ala Asp Met Thr Pro Arg Ala Met Leu Asp Ser Gly 
35 40 45 

Ala Asp Leu lie Val Phe Ala Thr Ala Thr Tyr Gly Glu Gly Glu Phe 
50 55 60 

Ala Gly Gly Gly Ala Ala Phe Phe Glu Thr Leu Arg Glu Thr Lys Pro 
65 70 75 80 

Asp Leu Ser Gly Leu Arg Phe Ala Val Phe Gly Leu Gly Asp Ser Tyr 
85 90 95 

Tyr Thr Thr Phe Asn Gin Ala Gly Ala Thr Ala Ala Thr He Leu Ala 
100 105 110 
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Ser Leu Gly Gly Thr Gin Val Gly Asp Thr Ala Arg His Asp Thr Ser 
115 & 120 125 

Ser Gly Asp Asp Pro Glu Glu Thr Ala Glu Glu Trp Ala Arg Glu He 
130 135 140 

Leu Thr Ala Leu Ala Thr Pro Ala Val Ser 
145 150 
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